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COMPOSITIONS FOR THE TREATODBNT AMD DIAGNOSIS 
OF BODY WEIGHT DISORDERS. IMCI^TlPTyg oaigp TTV 

1. INTRODUCTION 

The present invention relates to the mammalian tubby 
(tub) genes, including the human tub gene, which are novel 
genes involved in the control of mammalian body weight, 
including recombinant DNA molecules, cloned genes or 
degenerate variants thereof. The present invention further 
relates to novel mammalian, including human, tub gene 
products and to antibodies directed against such mammalian 
£ub gene products, or conserved variants or fragments 
thereof. The present invention also includes cloning vectors 
containing mammalian tub gene molecules, and hosts which have 
been transformed with such molecules. In addition, the 
present invention presents methods for the diagnostic 
evaluation and prognosis of mammalian body weight disorders, 
including obesity, cachexia and anorexia, and for the 
identification of subjects exhibiting a predisposition to 
such conditions. Further, methods and compositions are 
presented for the treatment of mammalian body weight 
disorders, including obesity, cachexia and anorexia. Still 
further, the present invention relates to methods for the use 
of the mammalian tub gene and/or mammalian tub gene products 
for the identification of compounds which modulate the 
expression of the mammalian tub gene and/or the activity of 
the mammalian tub gene products. Such compounds can be used 
as therapeutic agents in the treatment of mammalian body 
weight disorders, including obesity, cachexia and anorexia. 

2 . BACKGROUND OF THE INVENTION 
Obesity represents the most prevalent of body weight 
disorders, and it is the most important nutritional disorder 
in the western world, with estimates of its prevalence 
ranging from 30% to 50% within the middle-aged population. 
Other body weight disorders, such as anorexia nervosa and 
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bulimia nervosa which together affect approximately 0.2% of 
the female population of the western world, also pose serious 
health threats. Further, such disorders as anorexia and 
cachexia (wasting) are also prominent features of other 
5 diseases such as cancer, cystic fibrosis, and AIDS. 

Obesity, defined as an excess of body fat relative to 
lean body mass, also contributes to other diseases. For 
example, this disorder is responsible for increased 
incidences of diseases such as coronary artery disease, 

10 hypertension, stroke, diabetes, hyper lipidaemia and some 
cancers. (See, e.g. , Nishina, P.M. et al . , 1994, Metab. 
43:554-558; Grundy, S.M. &Barnett, J. P., 1990, Dis. Men. 
16:641-731) Obesity is not merely a behavioral problem, 
i -e. , the result of voluntary hyperphagia. Rather, the 

15 differential body composition observed between obese and 
normal subjects results from differences in both metabolism 
and neurologic/metabolic interactions. These differences 
seem to be, to some extent, due to differences in gene 
expression, and/or level of gene products or activity 

20 (Friedman, J.M. et al., 1991, Mammalian Gene 1:130-144). 

The epidemiology of obesity strongly shows that the 
disorder exhibits inherited characteristics (Stunkard, 1990, 
N. Eng. J. Med. 322:1483). Moll et al . have reported that, 
in many populations, obesity seems to be controlled by a few 

25 genetic loci (Moll et al. 1991, Am. J. Hum. Gen. 41:1243). 
In addition, human twin studies strongly suggest a 
substantial genetic basis in the control of body weight, with 
estimates of heritability of 80-90% (Simopoulos, A. P. & 
Childs B., eds., 1989, in "Genetic Variation and Nutrition in 

30 Obesity", World Review of Nutrition and Diabetes £1, S. 
Karger, Basel, Switzerland; Borjeson, M., 1976, Acta. 
Paediatr. Scand. 1^:279-287) , 

Studies of non-obese persons who deliberately attempted 
to gain weight by systematically over-eating were found to be 

35 more resistant to such weight gain and able to maintain an 
elevated weight only by very high caloric intake. In . 
contrast, spontaneously obese individuals are able to 
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maintain their status with normal or only moderately elevated 
caloric intake. In addition, it is a commonplace experience 
in animal husbandry that different strains of swine, cattle, 
etc., have different predispositions to obesity. Studies of 
5 the genetics of human obesity and of models of animal obesity 
demonstrate that obesity results from complex defective 
regulation of both food intake, food induced energy 
expenditure and of the balance between lipid and lean body 
anabolism. 

10 There are a number of genetic diseases in man and other 

species which feature obesity among their more prominent 
symptoms, along with, frequently, dysmorphic features and 
mental retardation. For example, Prader-Willi syndrome (PWS; 
reviewed in Knoll, J.H. et al., 1993, Am. J. Med. Genet. 

15 46:2-6) affects approximately 1 in 20,000 live births, and 
involves poor neonatal muscle tone, facial and genital 
deformities, and generally obesity. 

In addition to PWS, many other pleiotropic syndromes 
which include obesity as a symptom have been characterized. 

20 These syndromes are more genetically straightforward, and 

appear to involve autosomal recessive alleles. The diseases, 
which include, among others, Ahlstroem, Carpenter, Bardet- 
Biedl, Cohen, and Morgagni- Stewart -Monel Syndromes. 

A number of models exist for the study of obesity (see, 

25 e.g. , Bray, G.A. , 1992, Prog. Brain Res. 93:333-341, and 
Bray, G.A. , 1989, Amer. J. Clin. Nutr. 5:891-902). For 
example, animals having mutations which lead to syndromes 
that include obesity symptoms have also been identified. 
Attempts haye been made to utilize such animals as models for 

30 the study of obesity, and the best studied animal models, to 
date, for genetic obesity are mice. For reviews, see e.g. . 
Friedman, J.M. et al., 1991, Mamm. Gen. 1:130-144; Friedman, 
J.M. and Liebel, R.L., 1992, Cell £1:217-220,) 

Studies utilizing mice have confirmed that obesity is a 

35 very complex trait with a high degree of heritability . 

Mutations at a number of loci have been identified which lead 
to obese phenotypes. These include the autosomal recessive 
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mutations obese (fife) , diabetes (db) , fat (£a£) and tubby 
(tub) . In addition, the autosomal domiriant mutations Yellow 
at the agouti locus and Adipose (Ad) have been shown to 
contribute to an obese phenotype. 
5 The fib and db mutations are on chromosomes 6 and 4, 

respectively, but lead to clinically similar pictures of 
obesity, evident starting at about one month of age, which 
include hyperphagia, severe abnormalities in glucose and 
insulin metabolism, very poor thermoregulation and non- 
10 shivering thermogenesis, and extreme torpor and 
underdevelopment of the lean body mass. 

The ob gene and its human homologue have recently been 
cloned (Zhang, Y. et al . , 1994, Nature 172:425-432) . The 
gene appears to produce a 4.5 kb adipose tissue messenger RNA 
15 which contains a 167 amino acid open reading frame. The 
predicted amino acid sequence of the ob gene product 
indicates that it is a secreted protein and may, therefore, 
play a role as part of a signalling pathway from adipose 
tissue which may serve to regulate some aspect of body fat 
20 deposition. 

The db locus encodes a high affinity receptor for the fib 
gene product (Chen, H. et al . , Cell 84:491-495). The db gene 
product is a single membrane -spanning receptor most closely 
related to the gpl3 0 cytokine receptor signal transducing 

25 component (Tartaglia, L.A. et al . , 1995, Cell 83:1263-1271). 

Homozygous mutations at either the fat or tub loci cause 
obesity which develops more slowly than that observed in ob 
and ^ mice (Coleman, D.L,, and Eicher, E.M., 1990, J. 
Heredity 81:424-427) , with tub obesity developing slower than 

30 that observed in fat animals. This feature of the tub obese 
phenotype makes the development of tub obese phenotype 
closest in resemblance to the manner in which obesity 
develops in humans. Even so, however, the obese phenotype 
within such animals can be characterized as massive in that 

35 animals eventually attain body weights which are nearly two 
times the average weight seen in normal mice, tub / tub mice 
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develop insulin resistance with their weight gain but do not 
progress to overt diabetes. 

In addition to obesity, retinal defects, hearing loss 
and infertility have all been observed in tub mice 
5 (Heckenlively, , 1988, in Retinitis Pigmentosa, Heckenlively, 
• ed., Lippincott, Philadelphia, pp. 221-235; Coleman, D.L. & 
Eicher, E.M., 1990, J. Hered. 81:424-427; Ohlemiller, K.K. et 
al., 1995, Neuroreport 6;845-849). Several human syndromes 
exist in which such defects are found to co-exist with an 

10 obesity phenotype, including Bardet^Biedl syndrome, Ahlstroem 
syndrome, polycystic ovarian disease and Usher's syndrome. 

The fat mutation has been mapped to mouse chromosome 8, 
while the tub mutation has been mapped to mouse chromosome 7. 
According to Naggert et al , , the fat mutation has recently 

15 been identified (Naggert, J.K., et al . , 1995, Nature Genetics 
10:135-141). Specifically, the fat mutation appears to be a 
mutation within the Cpe locus > which encodes the 
carboxypeptidase (Cpe) E protein, Cpe is an exopeptidase 
involved in the processing of prohormones, including 

20 proinsulin. 

The dominant Yellow mutation at the agouti locus, causes 
a pleiotropic syndrome which causes moderate adult onset 
obesity, a yellow coat color, and a high incidence of tumor 
formation (Herberg, L. and Coleman, D.L., 1977, Metabolism 

25 2£:59), and an abnormal anatomic distribution of body fat 
(Coleman, D.L., 1978, Diabetologia 14 : 141-148) . This 
mutation may represent the only known example of a 
pleiotropic mutation that causes an increase, rather than a 
decrease, in body size. The mutation causes the widespread 

30 expression of a protein which is normally seen only in 
neonatal skin (Michaud, E.J. et al . , 1994, Genes Devel. 
8 :1463-1472) . 

Other animal models include fa/fa ( fatty ) rats, which 
bear many similarities to the ob/ob and db/db mice, discussed 
35 above. One difference is that, while fa/fa rats are very 
sensitive to cold, their capacity for non-shivering 
thermogenesis is normal. Torpor seems to play a larger part 
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in the maintenance of obesity in fa/fa rats than in the mice 
mutants. In addition, inbred mouse strains such as NZO mice 
and Japanese KK mice are moderately obese. Certain hybrid 
mice, such as the Wellesley mouse, become spontaneously fat. 
5 Further, several desert rodents, such as the spiny mouse, do 
not become obese in their natural habitats, but do become so 
when fed on standard laboratory feed. 

Animals which have been used as models for obesity have 
also been developed via physical or pharmacological methods. 

10 For example, bilateral lesions in the ventromedial 

hypothalamus (VMH) and ventrolateral hypothalamus (VLH) in 
the rat are associated, respectively, with hyperphagia and 
gross obesity and with aphagia, cachexia and anorexia. 
Further, it has been demonstrated that feeding monosodium- 

15 glutamate (MSG) or gold thioglucose to newborn mice also 
results in an obesity syndrome. 

In summary, therefore, obesity, which poses a major, 
worldwide health problem, represents a complex, highly 
heritable trait. Given the severity,, prevalence and 

20 potential heterogeneity of such disorders, there exists a 
great need for the identification of those genes that 
participate in the control of body weight. 

It is an objective of the invention to provide 
modulators, such as intracellular modulators, of body weight, 

25 to provide methods for diagnosis of body weight disorders, to 
provide therapy for such disorders and to provide an assay 
system for the screening of substances which can be used to 
control body weight. 

30 3. SUMMARY O F THE INVENTTOW 

The present invention relates to the identification of 
novel nucleic acid molecules and proteins encoded by such 
nucleic acid molecules or degenerate variants thereof, that 
participate in the control of mammalian body weight. The 
35 nucleic acid molecules of the present invention represent the 
genes corresponding to the mammalian tub gene, including the 
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human tub gene, which are involved in the regulation, control 
and/or modulation of body weight. 

In particular, the compositions of the present invention 
include nucleic acid molecules ( e.g. , tub gene) , including 
5 recombinant DNA molecules, cloned genes or degenerate 
variants thereof, especially naturally occurring variants, 
which encode novel tub gene products, and antibodies directed 
against such tub gene products or conserved variants or 
fragments thereof. The compositions of the present invention 

10 additionally include cloning vectors, including expression 
vectors, containing the nucleic acid molecules of the 
invention and hosts which have been transformed with such 
nucleic acid molecules. 

Nucleic acid sequences of a wild type and a mutant form 

15 of the murine tub gene are provided. The wild type murine 
tub gene produces a full length transcript of approximately 
7.0 kb and encodes a protein of 505 amino acids, the sequence 
of which is provided. The amino acid sequence of the 
predicted full length tufe gene product does not contain 

20 either a recognizable transmembrane domain or a signal 
sequence, suggesting that the tub gene product is an 
intracellular gene product. The mammalian tub gene is, as 
shown herein, expressed in the brain, including the 
hypothalamus . 

25 Nucleic acid sequences of a wild-type human tub gene are 

also provided. The human tub gene encodes a full length 
protein of 506 amino acids, the sequence of which is 
provided. The human tub gene and gene product are strikingly 
similar to the murine tub gene and gene product. 

30 Specifically, the human tub gene is, at the nucleotide level, 
89% identical to the murine tufe gene. Further, the amino 
acid sequence of the human tub gene product is 94% identical 
to the amino acid sequence of the murine tub gene product . 
Both murine and human tub genes produce transcripts 

35 which undergo alternative splicing. Such alternative 

splicing yields, in addition to the full length transcripts, 
transcripts which lack sequences corresponding to tub exon 5. 
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Nucleic acid sequences corresponding to such alternatively 
spliced transcripts and the tub gene products encoded by such 
alternatively spliced transcripts are provided herein. 

In addition, this invention presents methods for the 
5 diagnostic evaluation and prognosis of body weight disorders, 
including obesity, cachexia and anorexia, and for the 
identification of subjects having a predisposition to such 
conditions. For example, nucleic acid molecules of the 
invention can be used as diagnostic hybridization probes or 

10 as primers for diagnostic PGR analysis for the identification 
of tub gene mutations, allelic variations and regulatory 
defects in the tub gene, and of alternatively spliced 
transcripts produced by the tub gene. For example, human tub 
genomic sequences are provided which can be used to 

15 selectively amplify human tub exons for analysis. 

Further, methods and compositions are presented for the 
treatment of body weight disorders, including obesity, 
cachexia and anorexia. Such methods and compositions are 
capable of modulating the level of tub gene expression and/or 

20 the level of tub gene product activity. Such methods and 
compositions can also be utilized in the treatment or 
amelioration of symptoms of tub gene-related sensory defects 
( e.g. . eye and hearing) and fertility defects. 

Still further, the present invention relates to methods 

25 for. the use of the tub gene and/or tub • gene products for the 
identification of compounds which modulate tub gene 
expression and/or the activity of tub gene products. Such 
compounds can be used as agents to control body weight and, 
in particular, therapeutic agents in the treatment of body 

30 weight and body weight disorders, including obesity, cachexia 
and anorexia. Such methods and compositions can also be 
utilized in the treatment or amelioration of symptoms of tub 
gene-related sensory ( e.g. . eye and hearing) and fertility 
defects. It is further contemplated that the nucleic acid 

35 molecules, peptides and other compounds of the invention can 
have agricultural applications. For example, the ratio of 
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fat to lean tissue of agricultural animals can be favorably 
altered, e.g. , this ratio can be decreased. 

This invention is based, in part, on the genetic and 
physical mapping of the tub gene to a specific portion of 
5 mouse chromosome 7, described in the Examples presented, 
below, in Section 6 and 7. The invention is further based, 
in part, on the expression and sequence analysis of a 
candidate tub gene using nucleic acid derived from wild type 
and tub homozygous animals, which proves that this candidate 

10 gene does, indeed, represent the tub gene. Such analyses are 
described in the Examples presented, below, in Sections 8-12, 
and include the identification of a splice site mutation in 
nucleic acid derived from tub animals which is absent from 
the corresponding nucleic acid derived from wild type, non- 

15 obese animals. This single base mutation consists of a 

guanine (G) to a thymidine (T) in the splice site recognition 
sequence, which results in the retention of an intronic 
sequence in the mature £ub mRNA that encodes an abnormal, 
loss-of -function, £ufe gene product. Further, Section 13 

20 presents the successful cloning of the human tub gene 
homologue . 

Still further, the Example presented in Section 14 
demonstrates that both the murine and human tub transcripts 
undergo alternative splicing. Section 15 demonstrates the 
25 successful expression of recombinant human and murine tub 
gene products. Finally, the Example presented in Section 16 
describes the identification, cloning and characterization of 
a human tub homolog. 

30 4. DESCRIPTION OF THE FIGURES 

FIG- 1. Physical map of the D7Mitl7 to D7Mit53 
interval of mouse chromosome 7. 

FIG. 2. Northern blot analysis of total RNA derived from 
various tissues of tub and wild type {C57BL/6J) mice, using 
35 the 90 bp P8X1 DNA fragment as a probe. See Sections 10.1 
and 10.2 for details. 
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FIG. 3. Northern blot analysis of total RNA derived 
from various tissues of tiib and wild type (C57BL/6J) mice, 
using the 1.15 kb fume009 cDNA clone as a probe. See 
Sections 9.1 and 9.2 for details. 
5 FIG. 4. Southern blot analysis of EcoRI -digested 

mammalian genomic DNA derived from a number of different 
species, as indicated, using a fragment of CBT9 (P8X9-10) as 
a probe, as described, below, in Sections 10.1 and 10.2. 

FIG. 5. In situ hybridization analysis of CBT9 spati..: 
10 expression in a brain (hypothalamus) tissue section of 
•C57BL/6J wild type mice, using a fume009 cDNA probe. 

FIG. 6. Nucleotide sequence of the coding region (and 
portions of 5' and 3' untranslated regions) of the wild type 
£ilb gene (bottom line) and the encoded amino acid sequence 
15 (top line) . 

FIG. 7. Alignment of cDNA and genomic sequences 
derived from wild type C57BL/6J and £ub RNA in the region of 
the splice site mutation. See Section 12.1 and 12.2 for 
details. 

20 FIG. 8. Schematic representation of the splicing 

defect in the CBT9 gene in tub animals. 

FIG. 9. Nucleotide sequence of the coding region (and 
portions of 5' and 3' untranslated regions) of the human tub 
gene (bottom line) and the encoded human tub gene product 

25 amino acid sequence (top line) . 

FIG. lOA-lOE. Human tub genomic sequence'. Depicted 
herein are human tub gene exons 4-12 nucleotide sequences and 
flanking intronic sequences. Intron boundaries are depicted 
in bold; exon sequences are underlined. lOA. Exon 4 

30 (corresponding to nucleotide sequence 254-397 of FIG. 9) and 
its flanking genomic sequence. lOB. Exon 5 (corresponding 
to nucleotide sequence 398-565 of FIG. 9) and its flanking 
genomic sequence. IOC. Exons 6-8 (corresponding to 
nucleotide sequences 566-687, 688-885, and 886-998 of FIG. 9, 

35 respectively) and its flanking genomic sequence. lOD. Exon 
9 (corresponding to nucleotide sequence 999-1116 of FIG. 9) 
and its flanking genomic sequence. lOE. Exons 10-12 
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(corresponding to nucleotide sequences 1117-1215, 1216-1387 
and 1388-1729 of FIG. 9, respectively) and its flanking 
genomic sequence. 

FIG. 11. SDS polyacrylamide protein gel demonstrating 
5 bacterial expression of recombinant murine and human tub gene 
products. Lanes from left to right: Pharmacia Low Molecular 
Weight Markers; uninduced BL21DE3 /human pET29*- tub : induced 
BL21DE3 /human pET29*-tiib; induced BL21DE3/ human pET29*-tub 
HlSg; induced BL21DE3 /murine pET29»- tub ; induced 
10 BL21DE3 /murine pET29*-£ufe HIS«. Arrow represents recombinant 
tub gene products. 

FIG. 12. Nucleotide and amino acid sequence of the 
human tub homolog l gene. Top line: amino acid sequence. 
Bottom line: nucleotide sequence. represents the stop 

15 codon. 

5. DETAILED DESCRIPTION OF THE TNVRMTTnN 
Described herein are the identification of the novel 
mammalian tubbv (tub) genes, including the human tub gene, 

20 which are involved in the control of mammalian body weight. 
Also described are recombinant mammalian, including human, 
tub DNA molecules, cloned genes, or degenerate variants 
thereof. The compositions of the present invention further 
include tub gene products ( e.g. . proteins) that are encoded 

25 by the tub gene, and the modulation of tub gene expression 
and/or tub gene product activity in the treatment of 
mammalian body weight, and body weight disorders, including 
obesity, cachexia and anorexia. Also described herein are 
antibodies against tub gene products ( e.g. . proteins) , or 

30 conserved variants or fragments thereof, and nucleic acid 
probes useful for the identification of fiub gene mutations 
and the use of such nucleic acid probes in diagnosing 
mammalian body weight disorders, including obesity, cachexia 
and anorexia. Further described are methods for the use of 

35 the tub gene and/or tub gene products in the identification 
of compounds which modulate the activity of the tub gene 
product . 
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The murine tub nucleic acid compositions of the 
invention are demonstrated in the Examples presented, below, 
in Sections 6 through 12, The human tub nucleic acid 
compositions of the invention are demonstrated in Section 13, 
5 below. For clarity, it should be noted that the murine tub 
gene is also referred to herein as the CBT9 gene, and was 
identified and cloned as follows. Genetic and physical 
mapping of the murine tub gene interval was narrowed to the 
interval between markers D7Mit39 and D7Mit53 . A PI genomic 

10 clone, P8, was located within this interval, as indicated in 
FIG. 1. A P8 subclone, designated ium008p004, was sequenced • 
An analysis of ium008p004 indicated that this sequence was 
part of the coding region of a gene. A 90 bp fragment, 
designated P8X1, was amplified from this ium008p004 subclone. 

15 P8X1 was used as a probe to screen a mouse brain cDNA 

library, resulting in the identification of a 1.15 kb cDNA 
clone, designated fumeOOS. fume009 was used as a probe to 
screen a mouse * hypothalamus cDNA library, resulting in the 
identification of a 6.0 kb cDNA clone, designated fumh019. 

20 To summarize, therefore, ium008p004, PX81, fume009 and 

fumh019 are all part of the murine tub gene, which is also 
referred to herein as the CBT9 gene, 

5.1. THE tub GENE 

25 The murine tub gene, shown in FIG. 6, and the human tub 

gene, shown in FIG, 9, are novel genes involved in the 
control of body weight. Nucleic acid sequences of the 
identified tub gene are described herein. As used herein, 
"tub gene" refers to (a) a gene containing the DNA sequence 

30 shown in FIG. 6 or FIG. 9 or contained in the cDNA clone 
fumh019, CBT9H1 or CBT9H3, or genomic clone P6, P8, or B13, 
as deposited with the American Type Culture Collection 
(ATCC) ; (b) any DNA sequence that encodes the amino acid 
sequence shown in FIG. 6 or FIG. 9-, or encodes the amino acid 

35 sequence shown in FIG. 6 or FIG. 9 but lacking the amino acid 
residues encoded by tub exon 5 ( i.e. , amino acid residues 
134-189 to 134-189 of FIG. 6 or FIG. 9), or encoded by the 
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cDNA clone fumh019, CBT9H1 or CBT9H3, or genomic clone P6 , 
P8, or B13, as deposited with the ATCC; (c) any DNA sequence 
that hybridizes to the complement of the DNA sequences that 
encode the amino acid sequence shovm in FIG. 6 or FIG. 9, or 
5 encodes the amino acid sequence shown in FIG. 6 or FIG. 9 but 
lacking the amino acid residues encoded by tub exon 5 ( i.e. . 
amino acid residues 134 to 189 of FIG. 6 or FIG. 9), or 
contained in the cDNA clone fumh019, CBT9H1 or CBT9H3, or 
genomic clone P6, P8, or B13, as deposited with the ATCC, 

10 under highly stringent conditions, e.g. . hybridization to 
filter-bound DNA in 0 . 5 M NaHPO^, 7% sodium dodecyl sulfate 
(SDS) , 1 im EDTA at SS^C, and washing in O.lxSSC/O.1% SDS at 
68°C (Ausubel F.M. et al . , eds . , 1989, Current Protocols in 
Molecular Biology, Vol. I, Green Publishing Associates, Inc., 

15 and John Wiley & sons, Inc., New York, at p. 2.10.3) and 

encodes a gene product functionally equivalent to a tub gene 
product encoded by sequences contained within the cDNA clone 
fumh019, CBT9H1 or CBT9H3 , sequences shown in FIG. 6 or FIG. 
9, sequences shown in FIG, 6 or FIG. 9, but lacking tub exon 

20 5, or genomic clone P6, P8, or B13; and/or (d) any DNA 
sequence that hybridizes to the complement of the DNA 
sequences that encode the amino acid sequence shown in FIG. 6 
or FIG. 9, or encode the amino acid sequence shown in FIG. 6 
or FIG. 9 but lacking the amino acid residues encoded by tub 

25 exon 5 (i.e.. amino acid residues 134 to 189 of FIG. 6 or 
FIG. 9) , contained in the cDNA clone fumh019, CBT9H1 or 
CBT9H3, or genomic clone P6, P8, or B13, as deposited with 
the ATCC, under less stringent conditions, such as moderately 
stringent conditions, e.g. . washing in 0.2xSSC/0.1% SDS at 

30 42»C (Ausubel et al . , 1989, supra), yet which still encodes a 
functionally equivalent £iab gene product. As used herein, 
tiib gene may also refer to degenerate variants of DNA 
sequences (a) through (d) , especially naturally occurring 
variants thereof. 

35 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are 
therefore the complements of, the DNA sequences (a) through 

- 13 - 



wo 97/02048 



PCTAJS96/ni86 



(d) , in the preceding paragraph. Such hybridization 
conditions may be highly stringent or less highly stringent, 
as described above. In instances wherein the nucleic acid 
molecules are deoxyoligonucleotides ("oligos"), highly 
5 stringent conditions may refer, e.g. . to washing in 
6xSSC/0,05% sodium pyrophosphate at 37®C {for 14-base 
oligos) , 48«>C (for 17-base oligos) , 55^0 (for 20-base 
oligos) , and 60°C (for 23-base oligos) . These nucleic acid 
molecules may encode or act as tub gene antisense molecules, 

10 useful, for example, in tub gene regulation (for and/or as 
antisense primers in amplification reactions of tub gene 
nucleic acid sequences. With respect to tub gene regulation, 
such techniques can be used to regulate, for example, 
cachexia and/or anorexia. Further, such sequences may be 

15 used as part of ribozyme and/or triple helix sequences, also 
useful for tub gene regulation. Still further, such 
molecules may be used as components of diagnostic methods 
whereby, for example, the presence of a particular tub allele 
or alternatively spliced tub transcript responsible for 

20 causing or predisposing one to a weight disorder, such as 
obesity, may be detected. Among the molecules which can be 
used for diagnostic methods such as these which involve 
amplification of genomic tub sequences are those listed in 
FIG. 10 and in Table I, below. 

25 The invention also encompasses (a) DNA vectors that 

contain any of the foregoing tub coding sequences and/or 
their complements ( i.e. , antisense) ; (b) DNA expression 
vectors that contain any of the foregoing tub coding 
sequences operatively associated with a regulatory element 

30 that directs the expression of the coding sequences; and (c) 
genetically engineered host cells that contain any of the 
foregoing tub coding sequences operatively associated with a 
regulatory element that directs the expression of the coding 
sequences in the host cell. As used herein, regulatory 

35 elements include but are not limited to inducible and non- 
inducible promoters, enhancers, operators and other elements 
known to those skilled in the art that drive and regulate 
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expression. Such regulatory elements include but are not 
limited to the cytomegalovirus hCMV immediate early gene, the 
early or late promoters of SV40 adenovirus, the lac system, 
the t£E system, the JAC system, the TRC system, the major 
5 operator and promoter regions of phage A, the control regions 
of fd coat protein, the promoter for 3 -phosphoglycerate 
kinase, the promoters of acid phosphatase, and the promoters 
of the yeast of-mating factors. The invention includes 
fragments of any of the DNA sequences disclosed herein. 

10 In addition to the tub gene sequences described above, 

homologs of such sequences, exhibiting extensive homology to 
one or more of domains of the tub gene product present in 
other species can be identified and readily isolated, without 
undue experimentation, by molecular biological techniques 

15 well knovm in the art. Further, there can exist homolog 
genes at other genetic loci within the genome that encode 
proteins which have extensive homology to one or more domains 
of the tub gene product. These genes can also be identified 
via similar techniques. 

20 As an example, in order to clone a human tub gene 

homologue using isolated murine tub gene sequences as 
disclosed herein, such murine tub gene sequences may be 
labeled and used to screen a cDNA library constructed from 
mRNA obtained from appropriate- cells or tissues ( e.g. . 

25 preferably hypothalamus, or brain) derived from the organism 
(in this case, human) of interest. With respect to the 
cloning of such a human tub homologue, a human fetal brain 
cDNA library ( e.g. . Clontech #HL1149x) may, for example, be 
used for screening. 

30 The hybridization washing conditions used should be of a 

lower stringency when the cDNA library is derived from an 
organism different from the type of organism from which the 
labeled sequence was derived. With respect to the cloning of 
a human tub homologue, for example, hybridization can be 

35 performed for 4 hours at 65° C using Amersham Rapid Hyb'** 

buffer (Cat. #RPN1639) according to manufacturer's protocol, 
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followed by washing, with a final washing stringency of 
l.OxSSC/0.1% SDS at 50^ C for 20 minutes being preferred. 

Low stringency conditions are well known to those of 
skill in the art, and will vary predictably depending on the 
5 specific organisms from which the library and the labeled 
sequences are derived. For guidance regarding such 
conditions see, for example, Sambrook et al . , 1989, Molecular 
Cloning, A Laboratory Manual, Cold Springs Harbor Press, 
N.Y.; and Ausubel et al., 1989, Current Protocols in 

10 Molecular Biology, Green Publishing Associates and Wiley 
Interscience , N . Y . 

Alternatively, the labeled fragment may be used to 
screen a genomic library derived from the organism of 
interest, again, using appropriately stringent conditions, 

15 Further, a tub gene homologue may be isolated from 

nucleic acid of the organism of interest by performing PGR 
using two degenerate oligonucleotide primer pools designed on 
the basis of amino acid sequences within the tub gene product 
disclosed herein. The template for the reaction may be cDNA 

20 obtained by reverse transcription of mRNA prepared from, for 
example, human or non-human cell lines or tissue known or 
suspected to express a tub gene allele. 

The PCR product may be subcloned and sequenced to ensure 
that the amplified sequences represent the sequences of a tub 

25 gene nucleic acid sequence. The PCR fragment may then be 
used to isolate a full length cDNA clone by a variety of 
methods. For example, the amplified fragment may be labeled 
and used to screen a cDNA library, such as a bacteriophage 
cDNA library. Alternatively, the labeled fragment may be 

30 used to isolate genomic clones via the screening of a genomic 
library. 

Taking, as an example, the cloning of a human tub 
homologue using murine tub nucleic acid sequences, among the 
murine tub primers which may be utilized for PCR 
35 amplification are, for example, the following, which are 
derived from the murine fumh019 sequence described, above: 
5'- CCG ACT CGA TTG CCA GTG TA -3' 
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5' - GCG GAT ACA GAC TCT CTC AT -3' 
These primers generate a cDNA product of approximately 950 
base pairs which can then be used as probe for the screening 
of appropriate cDNA libraries such as, for example, human 
5 fetal brain cDNA libraries ( e.g. , Clontech #HL1149x) . When a 
cDNA library is screened with probes such as this, 
hybridization can, for example, be performed for 4 hours at 
65° C using Amersham Rapid Hyb'*' buffer (Cat. #RPN1639) 
according to manufacturer's protocol, followed by washing, 

10 with a final washing stringency of l.Ox SSC/0,1% SDS at 50*^ C 
for 20 minutes being preferred. 

The Example presented in Section 16, below, describes 
the successful identification, cloning and characterization 
of a human tub homolog. 

15 PCR technology may also be utilized to isolate full 

length cDNA sequences. For example, RNA may be isolated, 
following standard procedures, from an appropriate cellular 
or tissue source ( i.e. . one known, or suspected, to express 
the tub gene, such as, for example, hypothalamus tissue) . A 

20 reverse transcription reaction may be performed on the RNA 
using an oligonucleotide primer specific for the most 5' end 
of the amplified fragment for the priming of first strand 
synthesis. The resulting RNA/DNA hybrid may then be "tailed" 
with guanines using a standard terminal transferase reaction, 

25 the hybrid may be digested with RNAase H, and second strand 
synthesis may then be primed with a poly-C primer. Thus, 
cDNA sequences upstream of the amplified fragment may easily 
be isolated. For a review of cloning strategies which may be 
used, see e.g. . Sambrook et al., 1989, supra. 

30 tub gene sequences may additionally be used to isolate 

mutant tub gene alleles. Such mutant alleles may be isolated 
from individuals either known or proposed to have a genotype 
which contributes to the symptoms of body weight disorders 
such as obesity, cachexia or anorexia. Mutant alleles and 

35 mutant allele products may then be utilized in the 
therapeutic and diagnostic systems described below. 
Additionally, such tub gene sequences can be used to detect 
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tub gene regulatory (e.g. , promoter) defects which can affect 
body weight . 

A cDNA of a mutant tub gene may be isolated, for 
example, by using PCR, a technique which is well known to 
5 those of skill in the art. In this case, the first cDNA 
strand may be synthesized by hybridizing an oligo-dT 
oligonucleotide to mRNA isolated from tissue known or 
suspected to be expressed in an individual putatively 
carrying the mutant tub allele, and by extending the new 

10 strand with reverse transcriptase. The second strand of the 
cDNA is then synthesized using an oligonucleotide that 
hybridizes specifically to the 5' end of the normal gene. 
Using these two primers, the product is then amplified via 
PGR, cloned into a suitable vector, and subjected to DNA 

15 sequence analysis through methods well known to those of 
skill in the art. By comparing the DNA sequence of the 
mutant £ub allele to that of the normal tixb allele, the 
mutation (s) responsible for the loss or alteration of 
function of the mutant tub gene product can be ascertained. 

20 Alternatively, a genomic library can be constructed 

using DNA obtained from an individual suspected of or known 
to carry the mutant tub allele, or a cDNA library can be 
constructed using RNA from a tissue known, or suspected, to 
express the mutant tub allele. The normal tub gene or any 

25 suitable fragment thereof may then be labeled and used as a 
probe to identify the corresponding mutant tub allele in such 
libraries. Clones containing the mutant tub gene sequences 
may then be purified and subjected to sequence analysis 
according to methods well known to those of skill in the art. 

30 Additionally, an expression library can be constructed 

utilizing cDNA synthesized from, for example, RNA isolated 
from a tissue known, or suspected, to express a mutant tub 
allele in an individual suspected of or known to carry such a 
mutant allele. In this manner, gene products made by the 

35 putatively mutant tissue may be expressed and screened using 
standard antibody screening techniques in conjunction with 
antibodies raised against the normal tub gene product, as 
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described, below, in Section 5.3. (For screening techniques, 
see, for example, Harlow, £• and Lane, eds., 1988, 
"Antibodies: A Laboratory Manual", Cold Spring Harbor Press, 
Cold Spring Harbor.) In cases where a tub mutation results 
5 in an expressed gene product with altered function ( e.g. , as 
a result of a missense or a f rameshif t mutation) , a 
polyclonal set of ant i- tub gene product antibodies are likely 
to cross-react with the mutant tub gene product- Library 
clones detected via their reaction with such labeled 
10 antibodies can be purified and subjected to sequence analysis 
according to methods well known to those of skill in the art. 

5 . 2 . PROTEIN PRODUCTS OF THE tub GENE 
tub gene products, or peptide fragments thereof, can be 

15 prepared for a variety of uses. For example, such gene 

products, or peptide fragments thereof, can be used for the 
generation of antibodies, in diacfnostic assays, or for the 
identification of other cellular gene products involved in 
the regulation of body weight. 

20 The amino acid sequence depicted in FIG. 6 represents a 

murine tub gene product, while the amino acid sequence 
depicted in FIG. 9 represents a human tub gene product. The 
tub gene product, sometimes referred to herein as a " tub 
protein" , may additionally include those gene products 

25 encoded by the tub gene sequences described in Section 5.1, 
above, and is intended to include, for example, a tub gene 
product encoded by a tub gene sequence lacking tub exon 5. 

In addition, tub gene products may include proteins that 
represent functionally equivalent gene products. Such an 

30 equivalent tub gene product may contain deletions, additions 
or substitutions of amino acid residues within the amino acid 
sequence encoded by the tub gene secjuences described, above, 
in Section 5.1, but which result in a silent change, thus 
producing a functionally equivalent, tub gene piroduct . Amino 

35 acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity , hydrophilicity , 
and/or the amphipathic nature of the residues involved. For 
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example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, 
5 and glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged 
(acidic) amino acids include aspartic acid and glutamic acid. 

"Functionally equivalent", as utilized herein, refers to 
a protein capable of exhibiting a substantially similar i^ 

10 vivo activity as the endogenous tub gene products encoded by 
the tub gene sequences described in Section 5.1, above. The 
in vivo activity of the tub gene product, as used herein, 
refers to the ability of the tub gene product, when present 
in an appropriate cell type, to ameliorate, prevent or delay 

15 the appearance of the obese phenotype relative to it 

appearance when that cell type lacks a functional £ab gene 
product. "Obese phenotype", as used herein, refers to the 
well known tub phenotype, db phenotype, or afe phenotype. In 
humans, this can also refer to an increased percentage of 

20 body fat which is medically considered abnormal. 

The tub gene products or peptide fragments thereof, may 
be produced by recombinant DNA technology using techniques 
well known in the art. Thus, methods for preparing the tub 
gene polypeptides and peptides -of the invention by expressing 

25 nucleic acid containing tub gene sequences are described 

herein. Methods which are well known to those skilled in the 
art can be used to construct expression vectors containing 
tub gene product coding sequences and appropriate 
transcriptional and translational control signals. These 

30 methods include, for example, in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic 
recombination. See, for example, the techniques described in 
Sambrook et al., 1989, supra, and Ausubel et al., 1989, 
supra. Alternatively, RNA capable, of encoding tub gene 

35 product sequences may be chemically synthesized using, for 
example, synthesizers. See, for example, the techniques 
described in "Oligonucleotide Synthesis", 1984, Gait, M.J. 
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ed., IRL Press, Oxford, which is incorporated by reference 
herein in its entirety - 

A variety of host -expression vector systems may be 
utilized to^ express the tub gene coding sequences of the 
5 invention. Such host -expression systems represent vehicles 
by which the coding sequences of interest may be produced and 
subsequently purified, but also represent cells which may, 
when transformed or transfected with the appropriate 
nucleotide coding sequences, exhibit the tub gene product of 

10 the invention in situ.. These include but are not limited to 
microorganisms such as bacteria ( e>a. , E. coli, B. suJbtilis) 
transformed with recombinant bacteriophage DNA, plasmid DNA 
or cosmid DNA expression vectors containing tub gene product 
coding sequences; yeast ( e.g. . Saccharomyces , Pichia) 

15 transformed with recombinant yeast expression vectors 

containing the tub gene product coding sequences; insect cell 
systems infected with recombinant virus expression vectors 
( e.g. . baculovirus) containing the tub gene product coding 
sequences; plant cell systems infected with recombinant virus 

20 expression vectors ( e.g. . cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with recombinant 
plasmid expression vectors ( e.g. . Ti plasmid) containing tub 
gene product coding sequences; or mammalian cell systems 
( e.g. . COS, CHO, BHK, 293, 3T3) harboring recombinant 

25 expression constructs containing promoters derived from the 
genome of mammalian cells ( e.g. , metallothionein promoter) or 
from mammalian viruses ( e.g. . the adenovirus late promoter; 
the vaccinia virus 7.5K promoter) . 

In bacterial systems, a number of expression vectors may 

30 be advantageously selected depending upon the use intended 
for the tub gene product being expressed. For example, when 
a large quantity of such a protein is to be produced, for the 
generation of pharmaceutical compositions of tub protein or 
for raising antibodies to tub protein, for example, vectors 

35 which direct the expression of high levels of fusion protein 
products that are readily purified may be desirable. Such 
vectors include, but aire not limited, to the E. coli 
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expression vector pUR278 (Ruther et al., 1983, EMBO J. 
2:1791), in which the £ufe gene product coding sequence may be 
ligated individually into the vector in frame with the lac Z 
coding region so that a fusion protein is produced; pIN 
5 vectors (Inouye & Inouye, 1985, Nucleic Acids Res. 13:3101- 
3109/ Van Heeke & Schuster, 1989, J. Biol. Chem, 264:5503- 
5509); and the like. pGEX vectors may also be used to 
express foreign polypeptides as fusion proteins with gluta- 
thione S-transferase (GST). In general, such fusion proteins 

10 are soluble and can easily be purified from lysed cells by 
adsorption to glutathione -agarose beads followed by elution 
in the presence of free glutathione. The pGEX vectors are 
designed to include thrombin or factor Xa protease cleavage 
sites so that the cloned target gene product can be released 

15 from the GST moiety. The Example presented in Section 15, 
below, describes the successful expression of both murine and 
human recombinant tub gene products utilizing modified pET 
vectors (Novagen, Inc., Madison WI) . 

In an insect system, Autoaraoha californica nuclear 

20 polyhedrosis virus (AcNPV) is used as a vector to express 
foreign genes. The virus grows in Spodootera frugiperda 
cells. The tub gene coding sequence may be cloned 
individually into non-essential regions (for example the 
polyhedrin gene) of the virus and placed under control of an 

25 AcNPV promoter (for example the polyhedrin promoter) . 

Successful insertion of tub gene coding sequence will result 
in inactivation of the polyhedrin gene and production of non- 
occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 

30 recombinant viruses are then. used to infect Spodoptera 
frugiperda cells in which the inserted gene is expressed. 
(E*<?- * see Smith et al., 1983, J. Virol. 46: 584; Smith, U.S. 
Patent No. 4,215,051) • 

In mammalian host cells, a number of viral -based 

35 expression systems may be utilized. In cases where an 
adenovirus is used as an expression vector, the tub gene 
coding sequence of interest may be ligated to an adenovirus 



- 22 - 



wo 97/02048 



PCT/US96/11186 



transcription/translation control complex, e.g. . the late 
promoter and tripartite leader sequence. This chimeric gene 
may then be inserted in the adenovirus genome by in vitro or 
in vivo recombination. Insertion in a non-essential region 
5 of the viral genome ( e.g. , region El or E3) will result in a 
recombinant virus that is viable amd capable of expressing 
tub gene product in infected hosts. ( E.g. , See Logan & 
Shenk, 1984, Proc . Natl. Acad, Sci. USA 81:3655-3659). 
Specific initiation signals may also be required for 
. 10 efficient translation of inserted tub gene product coding 
sequences. These signals include the ATG initiation codon 
and adjacent sequences. In cases where an entire tub gene, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no 

15 additional translational control signals may be needed. 

However, in cases where only a portion of the tub gene coding 
sequence is inserted, exogenous translational control 
signals, including, perhaps, the ATG initiation codon, must 
be provided. Furthermore, the initiation codon must be in 

20 phase with the reading frame of the desired coding sequence 
to ensure translation of the entire insert. These exogenous 
translational control signals and initiation codons can be of 
a variety of origins, both natural and synthetic. The 
efficiency of expression may be enhanced by the inclusion of 

25 appropriate transcription enhancer elements, transcription 
terminators, etc. (see Bittner et al . , 1987, Methods in 
Enzymol . 153 : 516 - 544 ) . 

In addition, a host cell strain may be chosen which 
modulates the expression of the inserted sequences, or 

30 modifies and processes the gene product in the specific 
fashion desired. Such modifications ( e.g. . glycosylation) 
and processing ( e.g. . cleavage) of protein products may be 
important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the 

35 post -translational processing and modification of proteins 
and gene products. Appropriate cell lines or host systems 
can be chosen to ensure the correct modification and 
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processing of the foreign protein expressed. To this end, 
eukaryotic host cells which possess the cellular machinery 
for proper processing of the primary transcript, 
glycosylation, and phosphorylation of the gene product may b< 
5 used. Such mammalian host cells include but are not limited 
to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , WI38, and in 
particular, hypothalamic cell lines such as GN and GH-l cell 
lines . 

For long-term, high-yield production of recombinant 
10 proteins, stable expression is preferred. For example, cell 
lines which stably express the tiob gene product may be 
engineered. Rather than using expression vectors which 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 

15 control elements , promoter, enhancer, sequences, 

transcription terminators, polyadenylation sites, etc.), and 
a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 

20 selective media. The selectable marker in the recombinant 
plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded 
into cell lines. This method may advantageously be used to 

25 engineer cell lines which express the tub gene product. Such 
engineered cell lines may be particularly useful in screening 
and evaluation of compounds that affect the endogenous 
activity of the tab gene product. 

The Example presented in Section 15, below, describes 

30 the successful expression of recombinant tab gene products in 
mammalian cell lines. 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1977, Cell 11:223.), hypoxan thine -guanine 

35 phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc. 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransferase (Lowy, et al., 1980, Cell 22:817) 
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processing of the foreign protein expressed. To this end, 
eukaryotic host cells which possess the cellular machinery 
for proper processing of the primary transcript, 
glycosylation, and phosphorylation of the gene product may be 
5 used. Such mammalian host cells include but are not limited 
to CHO, VERO, BHK, HeLa, COS, MDCK, 293, 3T3, WI38, and in 
particular, hypothalamic cell lines such as GN and GH-1 cell 
lines . 

For long-term, high-yield production of recombinant 
10 proteins, stable expression is preferred. For example, cell 
lines which stably express the tub gene product may be 
engineered- Rather than using expression vectors which 
contain viral origins of replication, host cells can be 
transformed with DNA controlled by appropriate expression 

15 control elements (e.g. . promoter, enhancer, sequences, 

transcription terminators, polyadenylation sites, etc.), and 
a selectable marker. Following the introduction of the 
foreign DNA, engineered cells may be allowed to grow for 1-2 
days in an enriched media, and then are switched to a 

20 selective media. The selectable marker in the recombinant 
plasmid confers resistance to the selection and allows cells 
to stably integrate the plasmid into their chromosomes and 
grow to form foci which in turn can be cloned and expanded 
into cell lines. This method may advantageously be used to 

25 engineer cell lines which express the tub gene product. Such 
engineered cell lines may be particularly useful in screening 
and evaluation of compounds that affect the endogenous 
activity of the tub gene product. 

The Example presented in Section 15, below, describes 

30 the successful expression of recombinant tab gene products in 
mammalian cell lines. 

A number of selection systems may be used, including but 
not limited to the herpes simplex virus thymidine kinase 
(Wigler, et al., 1977, Cell 11:223.), hypoxanthine -guanine 

35 phosphoribosyltransf erase (Szybalska & Szybalski, 1962, Proc. 
Natl. Acad. Sci. USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy, et al., 1980, Cell 22:817) 
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genes can be employed in tk", hgprt" or aprf cells, 
respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler, et al., 1980, 
5 Natl. Acad. Sci. USA 77:3567; O'Hare, et al., 1981, Proc, 
Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance 
to mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. 
Acad. Sci. USA 78:2072); neo, which confers resistance to the 
aminoglycoside G-418 (Colberre-Garapin, et al . , 1981, J. tlcl. 

10 Biol. 150:1); and hygro, which confers resistance to 
hygromycin (Santerre, et al., 1984, Gene 30:147). 

Alternatively, any fusion protein may be readily 
purified by utilizing an antibody specific for the fusion 
protein being expressed. For example, a system described by 

15 Janknecht et al. allows for the ready purification of non- 
denatured fusion proteins expressed in human cell lines 
(Janknecht, et al . , 1991, Proc. Natl. Acad. Sci. USA 88: 
8972-8976) . In this system, the gene of interest is 
subcloned into a vaccinia recombination plasmid such that the 

20 gene's open reading frame is translationally fused to an 
amino- terminal tag consisting of six histidine residues. 
Extracts from cells infected with recombinant vaccinia virus 
are loaded onto Ni^* • nitriloacetic acid-agarose columns and 
histidine-tagged proteins are selectively eluted with 

25 imidazole- containing buffers. The Example presented in 

Section 15, below, demonstrates the successful expression of 
carboxy- terminal histidine-tagged recombinant tub gene 
products • 

The tub gene products can also be expressed in 
30 transgenic animals. Animals of any species, including, but 
not limited to, mice, rats, rabbits, guinea pigs, pigs, 
micro-pigs, goats, and non-human primates, e.a; . baboons, 
monkeys, and chimpanzees may be used to generate tub 
transgenic animals. 
35 Any technique known in the art may be used to introduce 

the tub gene transgene into animals to produce the founder 
lines of transgenic animals. Such techniques include, but 
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are not limited to pronuclear microinjection (Hoppe, p.c. and 
Wagner, T.E., 1989, U.S. Pat. No. 4,873,191); retrovirus 
mediated gene transfer into germ lines (Van der Putten et 
al,, 1985, Proc. Natl. Acad. Sci., USA 82:6148-6152); gene 
5 targeting in embryonic stem cells (Thompson et al., 1989, 
Cell 56:313-321); electroporation of embryos (Lo, 1983, Mol 
Cell. Biol. 3:1803-1814); and sperm-mediated gene transfer 
(Lavitrano et al . , 1989, Cell 57:717-723); etc. For a review 
of such techniques, see Gordon, 1989, Transgenic Animals, 

10 Intl. Rev. Cytol. 115:171-229, which is incorporated by 
reference herein in its entirety. 

The present invention provides for transgenic animals 
that carry the Jmb transgene in all their cells, as well as 
animals which carry the transgene in some, but not all their 

15 cells, i.e. , mosaic animals. The transgene may be integrated 
as a single transgene or in concatamers, e.g. . head- to-head 
tandems or head-to- tail tandems. The transgene may also be 
selectively introduced into and activated in a particular 
cell type by following, for example, the teaching of Lasko et 

20 al. (Lasko, M. et al . , 1992, Proc. Natl. Acad. Sci. USA 89: 
6232-6236) . The regulatory sequences required for such a 
cell -type specific activation will depend upon the particular 
cell type of interest, and will be apparent to those of skill 
in the art. When it is desired that the tub gene transgene 

25 be integrated into the chromosomal site of the endogenous tub 
gene, gene targeting is preferred. Briefly, when such a 
technique is to be utilized, vectors containing some 
nucleotide sequences homologous to the endogenous tub gene 
are designed for the purpose of integrating, via homologous 

30 recombination with chromosomal sequences, into and disrupting 
the function of the nucleotide sequence of the endogenous tub 
gene. The transgene may also be selectively introduced into 
a particular cell type, thus inactivating the endogenous tub 
gene in only that cell type, by following, for example, the 

35 teaching of Gu et al . (Gu, et al., 1994, Science 265: 103- 
106) . The regulatory sequences required for such a cell -type 
specific inactivation will depend upon the particular cell 
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type of interest, and will be apparent to those of skill in 
the art . 

Once transgenic animals have been generated, the 
expression of the recombinant tub gene may be assayed 
5 utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PGR techniques to 
analyze animal tissues to assay whether integration of the 
transgene has taken place. The level of mRNA expression of 
the transgene in the tissues of the transgenic animals may 

10 also be assessed using techniques which include but are not 
limited to Northern blot analysis of tissue samples obtained 
from the animal, in situ hybridization analysis, and RT-PCR. 
Samples of tub gene-expressing tissue, may also be evaluated 
immunocytochemically using antibodies specific for the tub 

15 transgene product, 

5.3. ANTIBODIES TO tub GENE PRODUCTS 
Described herein are methods for the production of 
antibodies capable of specifically recognizing one or more 

20 tub gene product epitopes or epitopes of conserved variants 
or peptide fragments of the tub gene products. 

Such antibodies may include, but are not limited to, 
polyclonal antibodies, monoclonal antibodies (mAbs) , 
humanized or chimeric antibodies, single chain antibodies, 

25 Fab fragments, F(ab')2 fragments, fragments produced by a Fab 
expression library, anti-idiotypic (anti-Id) antibodies, and 
epitope-binding fragments of any of the above. Such 
antibodies may be used, for example, in the detection of a 
tub gene product in an biological sample and may, therefore, 

30 be utilized as part of a diagnostic or prognostic technique 
whereby patients may be tested for abnormal levels of tub 
gene products, and/or for the presence of abnormal forms of 
the such gene products. Such antibodies may also be utilized 
in conjunction with, for example, .compound screening schemes, 

35 as described, below, in Section 5.4.2, for the evaluation of 
the effect of test compounds on tub gene product levels 
and/or activity. Additionally, such antibodies can be used 
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in conjunction with the gene therapy techniques described, 
below, in Section 5.4.3, to, for example, evaluate the normal 
and/or engineered £ub- expressing cells prior to their 
introduction into the patient. 
5 Anti-£iib gene product antibodies may additionally be 

used as a method for the inhibition of abnormal tub gene 
product activity. Thus, such antibodies may, therefore, be 
utilized as part of weight disorder treatment methods. 

For the production of antibodies against a tub gene 

10 product, various host animals may be immunized by injection 
with a tub gene product, or a portion thereof. Such host 
animals may include but are not limited to rabbits, mice, and 
rats, to name but a few. Various adjuvants may be used to 
increase the immunological response, depending on the host 

15 species, including but not limited to Freund's (complete and 
incomplete) , mineral gels such as aluminum hydroxide, surface 
active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, dinitrophenol , and potentially useful human 

20 adjuvants such as BCG (bacille Calmette-Guerin) and 
Corynebacterium parvum . 

Polyclonal antibodies are heterogeneous populations of 
antibody molecules derived from the sera of animals immunized 
with an antigen, such as a tub gene product, or an antigenic 

25 functional derivative thereof. For the production of 

polyclonal antibodies, host animals such as those described 
above, may be immunized by injection with tub gene product 
supplemented with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations 

30 of antibodies to a particular antigen, may be obtained by any 
technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These 
include, but are not limited to, the hybridoma technique of 
Kohler and Milstein, (1975, Nature 256:495-497; and U.S. 

35 Patent No. 4,376,110), the human B-cell hybridoma technique 
(Kosbor et al . , 1983, Immunology Today 4:72; Cole, et al . , 
1983, Proc. Natl. Acad. Sci. USA 80:2026-2030), and the EBV- 
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hybridoma technique (Cole et al., 1985, Monoclonal Antibodies 
And Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such 
antibodies may be of any immunoglobulin class including IgG, 
IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma 
5 producing the mAb of this invention may be cultivated in 
vitro or in vivo. Production of high titers of mAbs in vivo 
makes this the presently preferred method of production. 

In addition, techniques developed for the production of 
"chimeric antibodies" (Morrison et al , , 1984, Proc. Natl. 

10 Acad, Sci., 81:6851-6855; Neuberger et al . , 1984, Nature, 
312:604-608; Takeda et al . , 1985, Nature, 314:452-454) by 
splicing the genes from a mouse antibody molecule of 
appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity 

15 can be used. A chimeric antibody is a molecule in which 

different portions are derived from different animal species, 
such as those having a variable region derived from a murine 
xttMo and a human immunoglobulin constant region. 

Alternatively, techniques described for the production 

20 of single chain antibodies (U.S. Patent 4,946,778; Bird, 

1988, Science 242:423-426; Huston et al . , 1988, Proc. Natl. 
Acad. Sci. USA 85:5879-5883; and Ward et al . , 1989, Nature 
334:544-54 6) can be adapted to produce single chain 
antibodies against tub gene products. Single chain 

25 antibodies are formed by linking the heavy and light chain 
fragments of the Fv region via an amino acid bridge, 
resulting in a single chain polypeptide. 

Antibody fragments which recognize specific epitopes may 
be generated by known techniques. For example, such 

30 fragments include but are not limited to: the FCabMj 

fragments which can be produced by pepsin digestion of the 
antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 
fragments. Alternatively, Fab expression libraries may be 

35 constructed (Huse et al . , 1989, Science, 246:1275-1281) to 
allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. 
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5.4, USES OF THE tub GENE. GENE PRO DUCTS. AND ANTIBQDTRfi 
Described herein are various applications of the tub 
gene, the tub gene product including peptide fragments 
thereof, and of antibodies directed against the tub gene 
5 product and peptide fragments thereof. Such applications 
include, for example, prognostic and diagnostic evaluation of 
body weight disorders and the identification of subjects with 
a predisposition to such disorders, as described, below, in 
Section 5.4.1. Additionally, such applications include 

10 methods for the treatment of body. weight and body weight 

disorders, as described, below, in Section 5.4.2, and for the 
identification of compounds which modulate the expression of 
the tub gene and/or the activity of the tub gene product, as 
described below, in Section 5.4.3. Such compounds can 

15 include, for example, other cellular products which are 

involved in body weight regulation. These compounds can be 
used, for example, in the amelioration of body weight 
disorders including obesity, cachexia and anorexia. 

While, for clarity, uses related to body weight disorder 

20 abnormalities are primarily described in this Section, it is 
to be noted that each of the diagnostic and therapeutic 
treatments described herein can additionally be utilized in 
connection with sensory ( e.g. . eye and hearing) and fertility 
defects that are commonly associated with mutations in the 

25 tub gene. That is, the diagnostic and prognostic techniques 
described herein can also be utilized to diagnose tub related 
eye, hearing and fertility abnormalities and/ or a 
predisposition toward the development of such eye, hearing 
and fertility abnormalities, while the therapeutic techniques 

30 described herein can be utilized for the amelioration of such 
eye, hearing and fertility defects. 



35 
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5.4.1. DXAGNOSIS OF BODY WEIGHT DISORD ER ABNORMAT.TTTPQ . 
A variety of methods can be employed for the diagnostic 
and prognostic evaluation of body weight disorders, including 
obesity, caqhexia and anorexia, and for the identification of 
5 subjects having a predisposition to such disorders. 

Such methods may, for example, utilize reagents such as 
the tub gene nucleotide sequences described in Sections 5.1, 
and antibodies directed against tub gene products, including 
peptide fragments thereof, as described, above, in Section 

10 5.3. Specifically, such reagents may be used, for example, 
for: (1) the detection of the presence of tub gene mutations, 
or the detection of either over- or under-expression of tub 
gene mRNA relative to the non-body weight disorder state or 
the qualitative or quantitative detection of alternatively 

15 spliced forms of tub transcripts which may correlate with 
certain body weight disorders or susceptibility toward such 
body weight disorders; and (2) the detection of either an 
over- or an under -abundance of tub gene product relative to 
the non-body weight disorder state or the presence of a 

20 modified ( e.g. . less than full length) tub gene product which 
correlates with a body weight disorder state or a progression 
toward a body weight disorder state. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits comprising 

25 at least one specific tub gene nucleic acid or anti- tub gene 
antibody reagent described herein, which may be conveniently 
used, e.g. , in clinical settings, to screen and diagnose 
patients exhibiting body weight disorder abnormalities and to 
screen and identify those individuals exhibiting a 

30 predisposition to developing a body weight disorder 
abnormality. 

For the detection of tub mutations, any nucleated cell 
can be used as a starting source for genomic nucleic acid. 
For the detection of tub transcripts or tub gene products, 
35 any cell type or tissue in which the tub gene is expressed, 
such as, for example, hypothalamus cells, may be utilized. 
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Nucleic acid-based detection techniques are described^ 
below, in Section 5.4.1.1. Peptide detection techniques are 
described, below, in Section 5.4.1.2. 

5 5.4.1,1 DETECTION OF tub GENE NUCLEIC ACID MOLECULES 

Mutations or polymorhisms within the tub gene can be 
detected by utilizing a number of techniques. Nucleic acid 
from any nucleated cell can be used as the starting point for 
such assay techniques, and may be isolated according to 

10 standard nucleic acid preparation procedures which are well 
known to those of skill in the art. 

Genomic DNA may be used in hybridization or 
amplification assays of biological samples to detect 
abnormalities involving tub gene structure, including point 

15 mutations, insertions, deletions and chromosomal 

rearrangements. Such assays may include, but are not limited 
to, direct sequencing (Wong, C. et al., 1987, Nature 330 :384- 
386) , single stranded conformational polymorphism analyses 
(SSCP; Orita, M. et al . , 1989, Proc. Natl. Acad, Sci. USA 

20 81:2766-2770), heteroduplex analysis (Keen, T.J. et al., 
1991, Genomics 11:199-205; Perry, D.J. & Carrell, R.W., 
1992) , denaturing gradient gel electrophoresis (DGGE; Myers, 
R.M, et al., 1985, Nucl , Acids Res. 13:3131-3145), chemical 
mismatch cleavage (Cotton, R.G. et al , , 1988, Proc. Natl. 

25 Acad. Sci. USA 85:4397-4401) and oligonucleotide 

hybridization (Wallace, R.B. et al., 1981, Nucl. Acids Res. 
9:879-894; Lipshutz, R.J. et al . , 1995, Biotechniques 1^:442- 
447) . 

Diagnostic methods for the detection of tub gene 
30 specific nucleic acid molecules, in patient samples or other 
appropriate cell sources, may involve the amplification of 
specific gene sequences, e.g. . by the polymerase chain 
reaction (PGR; the experimental embodiment set forth in 
Mullis, K.B., 1987, U.S. Patent No.. 4,683,202), followed by 
35 the analysis of the amplified molecules using techniques well 
known to those of skill in the art, such as, for example, 
those listed above. Utilizing analysis techniques such as 
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these, the amplified sequences can be compared to those which 
would be expected if the nucleic acid being amplified 
contained only normal copies of the tub gene in order to 
determine whether a tub gene mutation exists. 
5 Among those tub nucleic acid sequences which are 

preferred for such amplification-related diagnostic screening 
analyses are oligonucleotide primers which amplify tub exon 
sequences- The sequence of such oligonucleotide primers are, 
therefore, preferably derived from tub intron sequence so 
. 10 that the enire exon (or coding region) can be analyzed, as 
discussed below. Primer pairs useful for amplification of 
tub exons are preferably derived from adjacent introns. For 
example, in order to amplify tub exon 5, a forward primer 
derived from the tub intron upstream of exon 5 ( i,e> . the 

15 intron between tub exon 4 and 5) could be used in conjunction 
with a reverse primer derived from the tub intron downstream 
of exon 5 ( i.e. . the intron between tub exon 5 and 6) . 

Appropriate primer pairs can be chosen such that each of 
the twelve tub exons are amplified. FIG, 10 depicts each of 

20 human tub exons 4 through 12 and, further, depicts intron 
sequences flanking each of these exons. Primers for the 
amplification of tub exons can routinely be designed by one 
of skill in the art by utilizing such intron flanking 
sequence . 

25 As an example, and not by way of limitation. Table I, 

below, lists primers and primer pairs which can be utilized 
for the amplification of each of human tub exons 2 through 
12. In this table, a primer pair is listed for each of exons 
2 through 12, which consists of a forward primer derived from 

30 intron secjuence upstream of the exon to be amplified and a 
reverse primer derived from sequence downstream of the exon 
to be amplified. Each of the primer pairs can be utilized, 
therefore, as part of a standard PCR reaction to amplify an 
individual tub exon. For each of ,the primer pairs listed in 

35 Table I, the approximate size of the resulting amplified 
exon- containing fragment is listed. Utilizing the primer 
pairs of Table I to amplify human tub exon 5, for example, 
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primers F5 (the forward primer) and R5 (the reverse primer) 
would be used to amplify a fragment of approximately 250 base 
pairs that would contain the entire coding region of exon 5. 



5 TABLE I 



IQ 



15 



20 



25 



HUMAN TQl EZON 


PRIMER NAME AMD SEQX7ENCE 


AMPLIFIED 
FRAGMENT SIZE 


2 


F2 5'-GTT CAA GCT GGT 
TTC AAG ATG-3' 

R2 5 ' -ATC ATC CAG GGA 
AGA TGG AC- 3' 


F2/R2=200bp 


3 


F3 5'-CTT CCT GGT GGA 
GGC AGT G-3' 

R3 S'-GAA GCA GTG ACG 
GGA TGT GG-3' 


F3/R3=220bp 


4 


F4 5 ' -GGG TAG CGA GCT 
CTG GTC-3' 

R4 S'-TCC AAG TCA GGA 
GGA CAA AC-3' 


F4/R4=295bp 


5 


F5 5 ' -GAA AGT GCA TCT 
GAG AAC CTG- 3' 

R5 5' -CCT CCT CCT GGA 
TGT AAC TC-3' 


F5/R5=250bp 


6 


F6 5' -TGT GAC CAT GTG 
TAT TTC AGG-3' 

R6 5' -CCT CTA ACG GAT 
GAG CAG TC-3' 


F6/R6=234bp 


7 


F7 5' -GAT TTG GAT CCC 
AGA CCA CC-3' 

R7 5' -GAC TTC CAG TCA 
CAT TTC AGC-3' 


F7/R7=331bp 


8 


F8 5' -GTG CAG ACC AGA 
GGC TGA G-3' 

R8 5' -TTC AGG CCC TCT 
ACA GAC AG- 3' 


F8/R8=300bp 
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10 



HDMAN TUB EXON 


PRIMER NAME AMD SEQUENCE 


AMPLIFIED 
FRAGMENT SIZE 


9 


F9 5'-TCA TAG GAC AGA 
CGA TGA GC-3' 

R9 5 ' -GTC CTG GAT TTC 
ATA TCT ACC-3' 


F9/R9=210bp 


10 


FIG 5'-AGG TAA ATA GAC 
GCC TCA GG-3' 

RIO 5'-ACG TCT GCC CTT 
AGA AGC TC-3' 


Fl0/R10=218bp 


11 


Fll 5' -CTG GAC CTG GCT 
CAG GTG-3' 

KXX D -VjIL All AGG GTT 
AGA AAG TTC C-3' 


Fll/Rll»400bp 


12 


F12 5' -TCT TCC CTC ATG 
TGG TTT GG-3' 

R12 5' -CCA CAG GCA GGC 
AGO CAA G-3' 


F12/R12=300bp 



15 



20 



25 



30 



35 



Additional tub nucleic acid sequences which are 
preferred for such amplification-related analyses are those 
which will detect the presence of the tub gene splice site 
mutation described, below, in Section 10.2 and depicted in 
FIG. 7. 

Further, well-known genotyping techniques can be 
performed to type polymorphi sms that are in close proxiniity 
to mutations in the tub gene itself. These polymorphisms can 
be used to identify individuals in families likely to carry 
mutations. If a polymorphism exhibits linkage disequilibrium 
with mutations in the tub gene, it can also be used to 
identify individuals in the general population likely to 
carry mutations. Polymorphisms that can be used in this way 
include restriction fragment length polymorphisms (RFLPs) , 
which involve sequence variations in restriction enzyme 
target sequences, single -base polymorphisms and simple 
sequence repeat polymorphisms (SSLPs) . 
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For example, Weber (U.S. Pat. No. 5,075,217, which is 
incorporated herein by reference in its entirety) describes a 
DNA marker based on length polymorphisms in blocks of (dC- 
dA)n- (dG-dT)n short tandem repeats. The average separation 
5 of (dC-dA)n- (dG-dT)n blocks is estimated to be 30,000-60,000 
bp. Markers which are so closely spaced exhibit a high 
frequency co- inheritance, and are extremely useful in the 
identification of genetic mutations, such as, for example, 
mutations within the tub gene, and the diagnosis of diseases 

10 and disorders related to tub mutations. 

Also, Caskey et al. (U.S. Pat. No. 5,364,759, which is 
incorporated herein by reference in its entirety) describe a 
DNA profiling assay for detecting short tri and tetra 
nucleotide repeat sequences. The process includes extracting 

15 the DNA of interest, such as the tub gene, amplifying the 
extracted DNA, and labelling the repeat sequences to form a 
genotypic map of the individual's DNA. 

A tub probe could additionally be used to directly 
identify RFLPs. Additionally, a tub probe or primers derived 

20 from the tub sequence could be used to isolate genomic clones 
such as YACs, BACs, PACs, cosmids, phage or plasmids. The 
DNA contained in these clones can be screened for single -base 
polymorphisms or simple sequence length polymorphisms (SSLPs) 
using standard hybridization or sequencing procedures. 

25 Alternative diagnostic methods for the detection of tub 

gene-specific mutations or polymorphisms can include 
hybridization techniques which involve for example, 
contacting and incubating nucleic acids including recombinant 
DNA molecules, cloned genes or degenerate variants thereof, 

30 obtained from a sample, e.g. . derived from a patient sample 
or other appropriate cellular source, with one or more 
labeled nucleic acid reagents including recombincuat DNA 
molecules, cloned genes or degenerate variants thereof, as 
described in Section 5.1, under conditions favorable for the 

35 specific annealing of these reagents to their complementary 
sequences within the tub gene. Preferably, the lengths of 
these nucleic acid reagents are at least 15 to 30 
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nucleotides. After incubation, all non-annealed nucleic 
acids are removed from the nucleic acid; tub molecule hybrid. 
The presence of nucleic acids which have hybridized, if any 
such molecules exist, is then detected. Using such a 
5 detection scheme, the nucleic acid from the cell type :r 
tissue of interest can be immobilized, for example, to a 
solid support such as a membrane, or a plastic surface such 
as that on a microtiter plate or polystyrene beads. In this 
case, after incubation, non-annealed, labeled nucleic acid 

10 reagents of the type described in Section 5.1 are easily 
removed. Detection of the remaining, annealed, labeled tub 
nucleic acid reagents is accomplished using standard 
techniques well-known to those in the art. The tub gene 
sequences to which the nucleic acid reagents have annealed 

15 can be compared to the annealing pattern expected from a 

normal tub gene sequence in order to determine whether a tub 
gene mutation is present. 

Among the tub nucleic acid sequences which are preferred 
for such hybridization analyses are those which will detect 

20 the presence of the tub gene splice site mutation described, 
below, in Section 10.2 and depicted in FIG. 7. 

Quantitative and qualitative aspects of tub gene 
expression can also be assayed. For example, RNA from a cell 
type or tissue known, or suspected, to express the tub gene, 

25 such as brain, especially hypothalamus cells, may be isolated 
and tested utilizing hybridization or PGR techniques such as 
are described, above. The isolated cells can be derived from 
cell culture or from a patient. The analysis of cells taken 
from culture may be a necessary step in the assessment of 

30 cells to be used as part of a cell-based gene therapy 

technique or, alternatively, to test the effect of compounds 
on the expression of the tub gene. Such analyses may reveal 
both quantitative and qualitative aspects of the expression 
pattern of the £ub gene, including activation or inactivation 

35 of tub gene expression and presence of alternatively spliced 
tub transcripts. 
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In one embodiment of such a detection scheme, a cDNA 
molecule is synthesized from an RNA molecule of interest 
(e.q> ; by reverse transcription of the RNA molecule into 
cDNA) . All or part of the resulting cDNA is then used as the 
5 template for a nucleic acid amplification reaction, such as a 
PCR amplification reaction, or the like. The nucleic acid 
reagents used as synthesis initiation reagents ( e.g. , 
primers) in the reverse transcription and nucleic acid 
amplification steps of this method are chosen from among the 
10 tub gene nucleic acid reagents described in Section 5.1. The 
preferred lengths of such nucleic acid reagents are at least 
9-30 nucleotides. 

For detection of the amplified product, the nucleic acid 
amplification may be performed using radioactively or non- 
15 radioactively labeled nucleotides. Alternatively, enough 
amplified product may be made such that the product may be 
visualized by standard ethidium bromide staining or by 
utilizing any other suitable nucleic acid staining method. 
Such RT-PCR techniques can be utilized to detect 
20 differences in tub transcript size which may be due to normal 
or abnormal alternative splicing. Additionally, such 
techniques can be performed using standard techniques to 
detect quantitative differences between levels of full length 
and/or alternatively spliced tub transcripts detected in 
25 normal individuals relative to those individuals exhibiting 
body weight disorders or exhibiting a predisposition to 
toward such body weight disorders. 

In the case where detection of specific alternatively 
spliced species is desired, appropriate primers and/or 
30 hybridization probes can be used. Using the detection of 
transcripts containing tub exon 5, for example, appropriate 
amplification primers can be chosen which will only yield an 
amplified fragment using cDNA derived from an exon 5- 
containing transcript . One of the. primer pairs can be 
35 designed, for example, to specifically utilize an exon 5 

sequence. In the absence of such sequence, no amplification 
would occur. Alternatively, primer pairs may be chosen 
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Utilizing the sequence data depicted in FIGS, 6 and 9 to 
choose primers which will yield fragments of differing size 
depending on whether exon 5 is present or absent from the 
transcript tub transcript being utilized. 
5 As an alternative to amplification techniques, standard 

Northern analyses can be performed if a sufficient quantity 
of the appropriate cells can be obtained. Utilizing such 
techniques, quantitative as well as size related differences 
between tub transcripts can also be detected, 

10 Additionally, it is possible to perform such tub gene 

expression assays "in situ", i ,e. . directly upon tissue 
se:cions (fixed and/or frozen) of patient tissue obtained 
from biopsies or resections, such that no nucleic acid 
purification is necessary. Nucleic acid reagents such as 

15 those described in Section 5.1 may be used as probes and/or 
primers for such in situ procedures (see, for example, Nuovo, 
G.J., 1992, "PGR In Situ Hybridization: Protocols And 
Applications", Raven Press, NY). 

20 5.4.1.2. DETECTION OF tub GENE PRODUCTS 

Antibodies directed against wild type or mutant tub gene 
products or conserved variants or peptide fragments thereof, 
which are discussed, above, in Section 5.3, may also be used 
as body weight disorder diagnostics and prognostics, as 

25 described herein. Such diagnostic methods, may be used to 
detect abnormalities in the level of tub gene expression, or 
abnormalities in the structure and/or temporal, tissue, 
cellular, or subcellular location of tub gene product. 
Because evidence disclosed herein indicates that the tub gene 

30 product is an intracellular gene product, the antibodies and 
immunoassay methods described below have important in vitro 
applications in assessing the efficacy of treatments for 
body-weight disorders such as obesity, cachexia and anorexia. 
Antibodies, or fragments of antibodies, such as those 

35 described below, may be used to screen potentially 

therapeutic compounds in vitro to determine their effects on 
tub gene expression and tub peptide production. The 
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compounds which have beneficial effects on body weight 
disorders, such as obesity, cachexia and anorexia, can be 
identified, and a therapeutically effective dose determined. 
In vitro immunoassays may also be used, for example, to 
5 assess the efficacy of cell-based gene therapy for body 
weight disorders, including obesity, cachexia and anorexia. 
Antibodies directed against tub peptides may be used in vitro 
to determine the level of tub gene expression achieved in 
cells genetically engineered to produce tub peptides. Given 

10 that evidence disclosed' herein Indicates that the tub gene 
product is an intracellular gene product, such an assessment 
is, preferably, done using cell lysates or extracts. Such 
analysis will allow for a determination of the number of 
transformed cells necessary to achieve therapeutic efficacy 

15 in vivo, as well as optimization of the gene replacement 
protocol . 

The tissue or cell type to be analyzed will generally 
include those which are known, or suspected, to express the 
£ilb gene, such as, for example, hypothalamic cells. The 

20 protein isolation methods employed herein may, for example, 
be such as those described in Harlow and Lane (Harlow, E, and 
Lane, D,, 1988, "Antibodies: A Laboratory Manual" , Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York) , which is incorporated herein by reference in its 

25 entirety. The isolated cells can be derived from cell 

culture or from a patient. The analysis of cell taken from 
culture may be a necessary step in the assessment of cells to 
be used as part of a cell -based gene therapy technique or, 
alternatively, to test the effect of compounds on the 

30 expression of the tub gene. 

Preferred diagnostic methods for the detection of tub 
gene products or conserved variants or peptide • fragments 
thereof, may involve, for example, immunoassays wherein the 
tub gene products or conserved variants, including gene 

35 products which are the result of alternatively spliced 
transcripts, or peptide fragments are detected by their 
interaction with an anti- tub gene product -specific antibody. 
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For exati^le, antibodies, or fragments of antibodies, 
such as those described, above, in Section 5*3, useful in. the 
present invention may be used to quantitatively or 
qualitatively detect the presence of tub gene products or 
5 conserved variants or peptide fragments thereof. This eatf-^ 
accomplished, for exanqple, by immunofluorescence techniques 
employing a f luorescently labeled antibody (see belov,^^ii8 
Section) coupled with light microscopic, flow cytometric; w 
fluorimetric detection. Such techniques are especially 
10 preferred if such tub gene products are expressed on the cell 
surface. 

The antibodies (or fragments thereof) useful in the 
present invention may, additionally, be employed 
histologically, as in immunofluorescence or immunoelectron 

15 microscopy, for in situ detection of tub gene products or 
conserved variants or peptide fragments thereof. In si^y 
detection may be accomplished by removing a histological 
specimen from a patient, and applying thereto a labeled 
antibody of the present invention. The antibody (or 

20 fragment) is preferably applied by overlaying the labeled 

antibody (or fragment) onto a biological sample- Through the 
use of such a procedure, it is possible to determine not only 
the presence of the tub gene product, or conserved variants 
or peptide fragments, but also its distribution in the 

25 examined tissue. Using the present invention, those of 
ordinary skill will readily perceive that any of a wide 
variety of histological methods (such as staining procedures) 
can be modified in order to achieve such in situ detection. 

imnrunoassays for tub gene products or conserved variants 

30 or peptide fragments thereof will typically comprise 

incubating a sample, such as a biological fluid, a tissue 
extract, freshly harvested cells, or lysates of cells which 
have been incubated in cell culture, in the presence of a 
detectably labeled antibody capable of identifying tub gene 

35 products or conserved variants or peptide fragments thereof, 
and detecting the bound antibody by any of a number of 
technicpies well-known in the art. 
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The biological sample may be brought in contact with an.d 
immobilized onto a solid phase support or carrier such as 
nitrocellulose, or other solid support which is capable of 
immobilizing cells, cell particles or soluble proteins. The 
5 support may then be washed with suitable buffers followed by 
treatment with the detectably labeled tub gene specific 
antibody. The solid phase support may then be washed with 
the buffer a second time to remove unbound antibody. The 
amount of bound label on solid support may then be detected 

10 by conventional means. 

By "solid phase support or carrier" is intended any 
support capable of binding an antigen or an antibody. Well- 
known supports or carriers include glass, polystyrene, 
polypropylene, polyethylene, dextran, nylon, amylases, 

15 natural and modified celluloses, polyacrylamides, gabbros, 
and magnetite. The nature of the carrier can be either 
soluble to some extent or insoluble for the purposes of the 
present invention. The support material may have virtually 
any possible structural configuration ■ so long as the coupled 

20 molecule is capable of binding to an antigen or antibody. 
Thus, the support configuration may be spherical, as in a 
bead, or cylindrical, as in the inside surface of a test 
tube, or the external surface of a rod. Alternatively, the 
surface may be flat such as a sheet, test strip, etc. 

25 Preferred supports include polystyrene beads. Those skilled 
in the art will know many other suitable carriers for binding 
antibody or antigen, or will be able to ascertain the same by 
use of routine experimentation. 

The binding activity of a given lot of ant i- tub gene 

30 product antibody may be determined according to well known 
methods. Those skilled in the art will be able to determine 
operative and optimal assay conditions for each determination 
by employing routine experimentation. 

One of the ways in which the ^ub gene peptide-specif ic 

35 antibody can be detectably labeled is by linking the same to 
an enzyme and use in an enzyme immunoassay (EIA) (Voller, A., 
"The Enzyme Linked Immunosorbent Assay (ELISA)", 1978, 
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Diagnostic Horizons 2:1-7, Microbiological Associates 
Quarterly Publication, Walkersville, MD) ; Voller, A. et al . , 
1978, J. Clin. Pathol. 31:507-520; Butler, J.E., 1981, Meth, 
Enzymol. 73:482-523; Maggio, E. (ed.), 1980, Enzyme 
5 Immunoassay, CRC Press, Boca Raton, PL,; Ishikawa, E. et al., 
(eds.), 1981, Enzyme Immunoassay, Kgaku Shoin, Tokyo). The 
enzyme which is bound to the antibody will react with an 
appropriate substrate, preferably a chromogenic sxibstrate, in 
such a manner as to produce a chemical moiety which can be 

10 detected, for example, by spectrophotometric, fluorimetric or 
by visual means. Enzymes which can be used to detectably 
label the antibody include, but are not limited to, malate 
dehydrogenase, staphylococcal nuclease, delta-5-steroid 
isomerase, yeast alcohol dehydrogenase, alpha- 

15 glycerophosphate, dehydrogenase, triose phosphate isomerase, 
horseradish peroxidase, alkaline phosphatase, asparaginase, 
glucose oxidase, beta-galactosidase, ribonuclease, urease, 
catalase, glucose -6 -phosphate dehydrogenase, glucoamylase and 
acetylcholinesterase. The detection can be accomplished by 

20 colorimetric methods which employ a chromogenic substrate for 
the enzyme. Detection may also be accomplished by visual 
comparison of the extent of enzymatic reaction of a substrate 
in comparison with similarly prepared standards. 

Detection may also be accomplished using any of a 

25 variety of other immunoassays. For example, by radioactively 
labeling the antibodies or antibody fragments, it is possible 
to detect tub gene peptides through the use of a 
radioimmunoassay (RIA) (see, for example, Weintraub, B., 
Principles of Radioimmunoassays, Seventh Training Course on 

30 Radioligand Assay Techniques, The Endocrine Society, March, 
1986, which is incorporated by reference herein) . The 
radioactive isotope can be detected by such means as the use 
of a gamma counter or a scintillation counter or by 
au t or ad i ogr aphy . 

35 It is also possible to label the antibody with a 

fluorescent compound. When the f luorescently labeled 
antibody is exposed to light of the proper wave length, its 
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presence can then be detected due to fluorescence. Among the 
most commonly used fluorescent labeling compounds are 
fluorescein isothiocyanate, rhodamine, phycoerythrin, 
phycocyanin, allophycocyanin, o-phthaldehyde and 
5 f luorescamine . 

The antibody can also be detectably labeled using 
fluorescence emitting metals such as ^^^Eu, or others of the 
lanthanide series. These metals can be attached to the 
antibody using such metal chelating groups as 

10 diethylenetriaminepentacetic acid (DTPA) or 
ethylenediaminetetraacetic acid (EDTA) . 

The antibody also can be detectably labeled by coupling 
it to a chemiluminescent compound. The presence of the 
chemiluminescent -tagged antibody is then determined by 

15 detecting the presence of luminescence that arises during the 
course of a chemical reaction. Examples of particularly 
useful chemiluminescent labeling compounds are luminol, 
isoluminol, theromatic acridinium ester, imidazole, 
acridinium salt and oxalate ester. 

20 Likewise, a bioluminescent compound may be used to label 

the antibody of the present invention, Bioluminescence is a 
type of chemiluminescence found in biological systems in, 
which a catalytic protein increases the efficiency of the 
chemiluminescent reaction. The presence of a bioluminescent 

25 protein is determined by detecting the presence of 
luminescence. Important bioluminescent compounds for 
purposes of labeling are luciferin, lucif erase and aequorin. 

5.4.2. SCREENING ASSAYS FOR COMPOUNDS 
30 THAT MODULATE tub GENE ACTIVITY 

The following assays are designed to identify compounds 
that bind to tub gene products, bind to other intracellular 
proteins that interact with a tub gene product, to compounds 
that interfere with the interaction of the tub gene product 
35 with other intracellular proteins and to compounds which 
modulate the activity of tub gene ( i.e. , modulate the level 
of tub gene expression and/or modulate the level of tub gene 
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product activity) . Assays may additionally be utilized which 
identify compounds which bind to tub gene regulatory 
sequences (e.g. , promoter sequences). See e.g. . Piatt, K.A., 
1994, J, Biol. Chem. 261:28558-28562, which is incorporated 
5 herein by reference in its entirety, which may modulate the 
level of tub gene expression. Compounds may include, but are 
not limited to, small organic molecules which are able to 
cross the blood-brain barrier, gain entry into an appropriate 
cell and affect expression of the tub gene or some other ^ene 

10 involved in the body weight regulatory pathway, or other 
intracellular proteins. Methods for the identification of 
such intracellular proteins are described, below, in Section 
5.4.2.2. Such intracellular proteins may be involved in the 
control and/or regulation of body weight. Further, among 

15 these compounds are compounds which affect the level of tub 
gene expression and/or tub gene product activity and which 
can be used in the therapeutic treatment of body weight 
disorders, including obesity, cachexia and anorexia, as 
described, below, in Section 5.4.3. 

20 Compounds may include, but are not limited to, peptides 

such as, for example, soluble peptides, including but not 
limited to, Ig- tailed fusion peptides, and members of random 
peptide libraries; (see, e.g. . Lam, K.S. et al . , 1991, Nature 
354:82-84; Houghten, R. et al . 1991, Nature 354:84-86), and 

25 combinatorial chemistry-derived molecular library made of D- 
and/or L- configuration amino acids, phosphopeptides 
(including, but not limited to members of random or partially 
degenerate, directed phosphopeptide libraries; see, e.g. . 
Songyang, Z. et al . , 1993, Cell 72:767-778), antibodies 

30 (including, but not limited to, polyclonal, monoclonal, 
humanized, anti- idiotypic, chimeric or single chain 
antibodies, and FAb, F{ab')2 and FAb expression library 
fragments, and epitope -binding fragments thereof), and small 
organic or inorganic molecules. 

35 Compounds identified via assays such as those described 

herein may be useful, for example, in elaborating the 
biological function of the tub gene product, and for 
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ameliorating body weight disorders. Assays for testing the 
effectiveness of compounds, identified by, for example, 
techniques such as those described in Section 5.4.2.1- 
5.4.2.3, are discussed, below, in Section 5.4.2.4. 

5 

5.4.2.1. IN VITRO SCREENING ASSAYS FOR COMPOUNDS 

THAT BIN D TO THE tub GENE PRODUCT 

In vitro systems may be designed to identify compounds 

capable of binding the tub gene products of the invention. 

Compounds identified may be useful, for example, in 

modulating the activity of wild type and/or mutant Jaib gene 

products, may be useful in elaborating the biological 

function of the tufe gene product, may be utilized in screens 

for identifying compounds that disrupt normal tub gene 

product interactions, or may in themselves disrupt such 

interactions . 

The principle of the assays used to identify compounds 
that bind to the tub gene product involves preparing a 
reaction mixture of the tub gene product and the test 

2Q compound under conditions and for a time sufficient to allow 
the two components to interact and bind, thus forming a 
complex which can be removed and/or detected in the reaction 
mixture. These assays can be conducted in a variety of ways. 
For example, one method to conduct such an assay would 

25 involve anchoring Jtjob gene product or the test substance onto 
a solid phase and detecting tub gene product/test compound 
complexes anchored on the solid phase at the end of the 
reaction. In one embodiment of such a method, the £iib gene 
product may be anchored onto a solid surface, and the test 

30 compound, which is not anchored, may be labeled, either 
directly or indirectly. 

In practice, microtiter plates may conveniently be 
utilized as the solid phase. The anchored component may be 
immobilized by non-covalent or covalent attachments. Non- 

3g covalent attachment may be accomplished by simply coating the 
solid surface with a solution of the protein and drying. 
Alternatively, an immobilized antibody, preferably a 
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monoclonal antibody, specific for the protein to be 
immobilized may be used to anchor the protein to the solid 
surface. The surfaces may be prepared in advance and stored. 
In order to conduct the assay, the nonimmobilized 
5 component is added to the coated surface containing the 
anchored component. After the reaction is complete, 
unreacted components are removed ( e.g. . by washing) under 
conditions such that any complexes formed will remain 
immobilized on the solid surface. The detection of complexes 

10 anchored on the solid surface can he accomplished in a number 
of ways. Where the previously nonimmobilized component is 
P3re-labeled, the detection of label immobilized on the 
surface indicates that complexes were formed- Where the 
previously nonimmobilized component is not pre- labeled, an 

15 indirect label can be used to detect complexes anchored on 
the surface; e.g. . using a labeled antibody specific for the 
previously nonimmobilized component (the antibody, in turn, 
may be directly labeled or indirectly labeled with a labeled 
anti-Ig antibody) . 

20 Alternatively, a reaction can be conducted in a liquid 

phase, the reaction products separated from unreacted 
components, and complexes detected; e.g. . using an 
immobilized antibody specific for tub gene product or the 
test compound to anchor any complexes formed in solution, and 

25 a labeled antibody specific for the other component of the 
possible complex to detect anchored complexes. 

5.4-2.2. ASSAYS FOR INTRACELLULAR PROTEINS 

THAT INTERACT WITH THE TUB GENE PRODUCT 

30 Any method suitable for detecting protein-protein 

interactions may be employed for identifying tub protein- 
intracellular protein interactions. 

Among the traditional methods which may be employed are 
co-immunoprecipitation, crosslinking and co-purification 

35 through gradients or chromatographic columns. Utilizing 
procedures such as these allows for the identification of 
intracellular proteins which interact with tub gene produces. 
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Once isolated, such an intracellular protein can be 
identified and can, in turn, be used, in conjunction with 
standard techniques, to identify proteins it interacts with. 
For example, at least a portion of the amino acid sequence of 
5 the intracellular protein which interacts with the tub gene 
product can be ascertained using techniques well known to 
those of skill in the art, such as via the Edman degradation 
technique (see, e.g. , Creighton, 1983, "Proteins: Structures 
and Molecular Principles", W.H. Freeman & Co., N.Y. , pp.34- 

10 49) . The amino acid sequence obtained may be used as a guide 
for the generation of oligonucleotide mixtures that can be 
used to screen for gene sequences encoding such .intracellular 
proteins. Screening made be accomplished, for example, by 
standard hybridization or PGR techniques. Techniques for the 

15 generation of oligonucleotide mixtures and the screening are 
well-known. (See, e.g. , Ausubel, supra . . and PGR Protocols: 
A Guide to Methods and Applications, 1990, Innis, M. et al., 
eds. Academic Press, Inc., New York). 

Additionally, methods may be employed which result in 

20 the simultaneous identification of genes which encode the 
intracellular protein interacting with the tub protein. 
These methods include, for example, probing expression 
libraries with labeled tub protein, using tub protein in a 
manner similar to the well known technique of antibody 

25 probing of Xgtll libraries. 

One method which detects protein interactions jji vivo , 
the two-hybrid system, is described in detail for 
illustration only and not by way of limitation- One version 
of this system has been described (Ghien et al., 1991, Proc. 

30 Natl. Acad. Sci. USA, 88:9578-9582) and is commercially 
available from Glontech (Palo Alto, CA) . 

Briefly, utilizing such a system, plasmids are 
constructed that encode two hybrid proteins: one consists of 
the DNA-binding domain of a transcription activator protein 

35 fused to the tub gene product and the other consists of the 
transcription activator protein's activation domain fused to 
an unknown protein that is encoded by a cDNA which has been 



- 48 - 



wo 97/02048 



PCT/US96/11186 



recombined into this plastnid as part of a cDNA library. The 
DNA-binding domain fusion plasmid and the cDNA library are 
transformed into a strain of the yeast Saccharomvces 
cerevisiae that contains a reporter gene ( e.g. . HBS or lacZ ) 
5 whose regulatory region contains the transcription 

activator's binding site. Either hybrid protein alone cannot 
activate transcription of the reporter gene: the DNA-binding 
domain hybrid cannot because it does not provide activation 
function and the activation domain hybrid cannot because it 
10 cannot localize to the activator's binding sites. 

Interaction of the two hybrid proteins reconstitutes the 
functional activator protein and results in expression of the 
reporter gene, which is detected by an assay for the reporter 
gene product . 

15 The two-hybrid system or related methodology may be used 

to screen activation domain libraries for proteins that 
interact with the "bait" gene product. By way of example, 
and not by way of limitation, tub gene products may be used 
as the bait gene product . Total genomic or cDNA sequences 

20 are fused to the DNA encoding an activation domain. This 
library and a plasmid encoding a hybrid of a bait tub gene 
product fused to the DNA-binding domain are cotransformed 
into a yeast reporter strain, and the resulting transformants 
are screened for those that express the reporter gene . For 

25 example, and not by way of limitation, a bait tub gene 

sequence, such as the 1.5 kb open reading frame of the tub 
gene, as depicted in FIG. 6 or FIG. 9 can be cloned into a 
vector such that it is translationally fused to the DNA 
encoding the DNA-binding domain of the GAL4 protein. These 

30 colonies are purified and the library plasmids responsible 
for reporter gene expression are isolated. DNA sequencing is 
then used to identify the proteins encoded by the library 
plasmids . 

A cDNA library of the cell line from which proteins that 
35 interact with bait tub gene product are to be detected can be 
made using methods routinely practiced in the art. According 
to the particular system described herein, for example, the 
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cDNA fragments can be inserted into a vector such that they 
are translationally fused to the transcriptional activation 
domain of GAL4 . This library can be co-transformed along 
with the bait tub gene-GAL4 fusion plasmid into a yeast 
5 strain which contains a lacZ gene driven by a promoter which 
contains GAL4 activation sequence. A cDNA encoded protein, 
fused to GAL4 transcriptional activation domain, that 
interacts with bait tub gene product will reconstitute an 
active GAL4 protein and thereby drive expression of the H'^SS 

10 gene. Colonies which express HIS3 can be detected by their 
growth on petri dishes containing semi -sol id agar based media 
lacking histidine. The cDNA can then be purified from these 
strains, and used to produce and isolate the bait tub gene- 
interacting protein using techniques routinely practiced in 

15 the art . 



5.4.2.3. ASSAYS FOR COMPOUNDS THAT INTERFERE 
.WITH TUB GENE PRODUCT/ INTRACELLULAR 
MACROMOLECULE INTERACTION 

The tub gene products of the invention may, in vivo, 
interact with one or more intracellular macromolecules, such 
as proteins. Such macromolecules may include, but are not 
limited to, nucleic acid molecules and those proteins 
identified via methods such as those described, above, in 
Section 5.4.2.2. For purposes of this discussion, such 
intracellular macromolecules are referred to herein as 
"binding partners". Compounds that disrupt tub binding in 
this way may be useful in regulating the activity of the tub 
gene product, especially mutant tub gene products. Such 
compounds may include, but are not limited to molecules such 
as peptides, and the like, as described, for example, in 
Section 5.4.2.1. above, which would be capable of gaining 
access to the intracellular tub gene product. 

The basic principle of the assay systems used to 
identify compounds that interfere with the interaction 
between the tub gene product and its intracellular binding 
partner or partners involves preparing a reaction mixture 
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containing the tufe gene product, and the binding partner 
under conditions and for a time sufficient to allow the two 
to interact and bind, thus forming a complex. In order to 
test a compound for inhibitory activity, the reaction mixture 
5 is prepared in the presence and absence of the test compound. 
The test compound may be initially included in the reaction 
mixture, or may be added at a time subsequent to the addition 
of tub gene product and its intracellular binding partner. 
Control reaction mixtures are incubated without the test 

10 compound or with a placebo. The formation of any complexes 
between the tub gene protein and the intracellular binding 
partner is then detected. The formation of a complex in the 
control reaction, but not in the reaction mixture containing 
the test compound, indicates that the compound interferes 

15 with the interaction of the tub gene protein and the 

interactive binding partner. Additionally, complex formation 
within reaction mixtures containing the test compound and 
normal tub gene protein may also be compared to complex 
formation within reaction mixtures containing the test 

20 compound and a mutant tub gene protein. This comparison may 
be important in those cases wherein it is desirable to 
identify compounds that disrupt interactions of mutant but 
not normal tub gene proteins . 

The assay for compounds that interfere with the 

25 interaction of the tub gene products and binding partners can 
be conducted in a heterogeneous or homogeneous format . 
Heterogeneous assays involve anchoring either the tub gene 
product or the binding partner onto a solid phase and 
detecting complexes anchored on the solid phase at the end of 

30 the reaction. In homogeneous assays, the entire reaction is 
carried out in a liquid phase. In either approach, the order 
of addition of reactants can be varied to obtain different 
information about the compounds being tested. For example, 
test compounds that interfere with the interaction between 

35 the tub gene products and the binding partners, e.g. , by 

competition, can be identified by conducting the reaction in 
the presence of the test substance; i.e. , by adding the test 
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substance to the reaction mixture prior to or simultaneously 
with the £ufe gene protein and interactive intracellular 
binding partner. Alternatively, test compounds that disrupt 
preformed complexes, e.g. compounds with higher binding 
5 constants that displace one of the components from the 
. complex, can be tested by adding the test compound to the 
reaction mixture after complexes have been formed. The 
various formats are described briefly below. 

In a heterogeneous assay system, either the £ub gene 

10 product or the interactive intracellular binding partner, is 
anchored onto a solid surface, while the non-anchored species 
is labeled, either directly or indirectly. In practice, 
raicrotiter plates are conveniently utilized. The anchored 
species may be immobilized by non-covalent or covalent 

15 attachments. Non-covalent attachment may be accomplished 
simply by coating the solid surface with a solution of the 
ijife gene product or binding partner and drying. 
Alternatively, an immobilized antibody specific for the 
species to be anchored may be used to anchor the species to 

20 the solid surface. The surfaces may be prepared in advance 
and stored. 

In order to conduct the assay, the partner of the 
immobilized species is exposed to the coated surface with or 
without the test compound. After the reaction is complete, 

25 unreacted components are removed ( e.g. . by washing) and any 
complexes formed will remain immobilized on the solid 
surface. The detection of complexes anchored on the solid 
surface can be accomplished in a number of ways. Where the 
non- immobilized species is pre-labeled, the detection of 

30 label immobilized on the surface indicates that complexes 
were formed. Where the non- immobilized species is not pre- 
labeled, an indirect label can be used to detect complexes 
anchored on the surface; e.g. . using a labeled antibody 
specific for the initially non-immobilized species (the 

35 antibody, in turn, may be directly labeled or indirectly 

labeled with a labeled anti-Ig antibody) . Depending upon the 
order of addition of reaction components, test compounds 
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which inhibit complex formation or which disrupt preformed 
complexes can be detected. 

Alternatively, the reaction can be conducted in a liquid 
phase in the presence or absence of the test compound, the 
5 reaction products separated from unreacted components, and 
complexes detected; e.o, , using an immobilized antibody 
specific for one of the binding components to anchor any 
complexes formed in solution, and a labeled antibody specific 
for the other partner to detect anchored complexes. Again, 

10 depending upon the order of addition of reactants to the 
liquid phase, test compounds which inhibit complex or which 
disrupt preformed complexes can be identified. 

In an alternate embodiment of the invention, a 
homogeneous assay can be used. In this approach, a preformed 

15 complex of the tub gene protein and the interactive 

intracellular binding partner is prepared in which either the 
£ub gene product or its binding partners is labeled, but the 
signal generated by the label is quenched due to complex 
formation (see, e.g. . U.S. Patent No. 4,109,4 96 by Rubenstein 

20 which utilizes this approach for immunoassays) . The addition 
of a test substance that competes with and displaces one of 
the species from the preformed complex will result in the 
generation of a signal above background. In this way, test 
substances which disrupt tub gene protein/intracellular 

25 binding partner interaction can be identified. 

In a particular embodiment, the tub gene product can be 
prepared for immobilization using recombinant DNA techniques 
described in Section 5.2. above. For example, the tub coding 
region can be fused to a glutathione-S-transf erase (GST) gene 

30 using a fusion vector, such as pGEX-5X-l, in such a manner 
that its binding activity is maintained in the resulting 
fusion protein. The interactive intracellular binding 
partner can be purified and used to raise a monoclonal 
antibody, using methods routinely practiced in the art and 

35 described above, in Section 5.3. This antibody can be 
labeled with the radioactive isotope ^^^I, for example, by 
methods routinely practiced in the art. In a heterogeneous 
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assay, e.g. , the GST- tub fusion protein can be anchored to 
glutathione-agarose beads. The interactive intracellular 
binding partner can then be added in the presence or absence 
of the test compound in a manner that allows interaction and 
5 binding to occur. At the end of the reaction period, unbound 
material can be washed away, and the labeled monoclonal 
antibody can be added to the system and allowed to bind to 
the complexed components. The interaction between the tub 
gene protein and the interactive intracellular binding 
10 partner can be detected by measuring the amount of 

radioactivity that remains associated with the glutathione- 
agarose beads. A successful inhibition of the interaction by 
the test compound will result in a decrease in measured 
radioactivity. 

15 Alternatively, the GST-t]ib gene fusion protein and the 

interactive intracellular binding partner can be mixed 
together in liquid in the absence of the solid glutathione- 
agarose beads. The test compound can be added either during 
or after the species are allowed to interact. This mixture 

20 can then be added to the glutathione-agarose beads and 
unbound material is washed away. Again the extent of 
inhibition of the tub gene product/binding partner 
interaction can be detected by adding the labeled antibody 
and measuring the radioactivity associated with the beads. 

25 In another embodiment of the invention, these same 

techniques can be employed using peptide fragments that 
correspond to the binding domains of the tub protein and/or 
the interactive intracellular or binding partner (in cases 
where the binding partner is a protein) , in place of one or 

30 both of the full length proteins. Any number of methods 
routinely practiced in the art can be used to identify and 
isolate the binding sites. These methods include, but are 
not limited to, mutagenesis of the gene encoding one of the 
proteins and screening for disruption of binding in a co- 

35 immunoprecipitation assay. Compensating mutations in the 
gene encoding the second species in the complex can then be 
selected. Sequence analysis of the genes encoding the 
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respective proteins will reveal the mutations that correspond 
to the region of the protein involved in interactive binding. 
Alternatively, one protein can be anchored to a solid surface 
using methods described in this Section above, and allowed to 
5 interact with and bind to its labeled binding partner, which 
has been treated with a proteolytic enzyme, such as trypsin. 
After washing, a short, labeled peptide comprising the 
binding domain may remain associated with the solid material, 
which can be isolated and identified by amino acid 
10 sequencing. Also, once the gene coding for the intracellular 
binding partner is obtained, short gene segments can be 
engineered to express peptide fragments of the protein, which 
can then be tested for binding activity and purified or 
synthesized. 

15 For example, and not by way of limitation, a tub gene 

product can be anchored to a solid material as described, 
above, in this Section by making a GST- tub fusion protein and 
allowing it to bind to glutathione agarose beads. The 
interactive intracellular binding partner can be labeled with 

20 a radioactive isotope, such as «S, and cleaved with a 

proteolytic enzyme such as trypsin. Cleavage products can 
then be added to the anchored GST-iJib fusion protein and 
allowed to bind. After washing away unbound peptides, 
labeled bound material, representing the intracellular 

25 binding partner binding domain, can be eluted, purified, and 
analyzed for amino acid sequence by well-known methods. 
Peptides so identified can be produced synthetically or fused 
to appropriate facilitative proteins using recombinant DNA 
technology. 

30 

5.4.2.4. ASSAYS FOR IDENTIFICATION OP COMPOUNDS 
THAT AMELIORATE RO DY WEIGHT DISORDERS 

Compounds-, including but not limited to binding 
compounds identified via assay techniques such as those 
35 described, above, in Sections 5.4.2.1-5.4.2.3, can be tested 
for the ability to ameliorate body weight disorder symptoms, 
including obesity. It should be noted that although t^ gene 
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products are intracellular molecules which are not secreted 
and have no transmembrane component, the assays described 
herein can identify compounds which affect gene activity 
by either af.fecting ^ gene expression or by affecting the 
5 level of tub gene product activity. For example, compounds 
may be identified which are involved in another step in the 
pathway in which the £ub gene and/or £ub gene product is 
involved and, by affecting this same pathway may modulate the 
affect of tub on the development of body weight disorders. 
10 Such compounds can be used as part of a therapeutic method 
for the treatment of body weight disorders. 

Described below are cell-based and animal model-based 
assays for the identification of compounds exhibiting such an 
ability to ameliorate body weight disorder symptoms. 
15 First, cell -based systems can be used to identify 

compounds which may act to ameliorate body weight disorder 
symptoms. Such cell systems can include, for example, 
recombinant or non- recombinant cell, such as cell lines, 
which express the tub gene. For example, hypothalamus cells, 
20 such as, for example GH-l (Melcang; R.C. et al . , 1995, 

Endocrinology 136 :-67 9 -686) and GN (Radovick, S. et al!, 1991, 
Proc. Natl. Acad. Sci. USA lfi:3402-3406) hypothalamic cell 
lines can be used. 

In utilizing such cell systems, cells may be exposed to 
25 a compound, suspected of exhibiting an ability to ameliorate 
body weight disorder symptoms, at a sufficient concentration 
and for a time sufficient to elicit such an amelioration of 
body weight disorder symptoms in the exposed cells. After 
exposure, the cells can be assayed to measure alterations in 
30 the expression of the ijife gene, e^, by assaying cell 

lysates for ijab mRNA transcripts (g^, by Northern analysis) 
or for tub protein expressed in the cell; compounds which 
increase expression of the tub gene are good candidates as 
therapeutics. Alternatively, the cells are examined to 
35 determine whether one or more body weight disorder- lilce 
cellular phenotypes has been altered to resemble a more 
normal or more wild type, non-body weight disorder phenotype, 
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or a phenotype more likely to produce a lower incidence or 
severity of disorder syn5)tonis. 

In addition, animal -based body weight disorder systems, 
which may include, for example Jtufe mice, may be used to 
5 identify compounds capable of ameliorating body weight 
disorder-like symptoms. Such animal models may be used as 
test substrates for the identification of drugs, 
pharmaceuticals, therapies and interventions which may be 
effective in treating such disorders. For example, animal 

10 models may be exposed to a compound, suspected of exhibiting 
an ability to ameliorate body weight disorder symptoms, at a 
sufficient concentration and for a time sufficient to elicit 
such an amelioration of body weight disorder symptoms in the 
exposed animals. The response of the animals to the exposure 

15 may be monitored by assessing the reversal of disorders 

associated with body weight disorders such as obesity. With 
regard to intervention, any treatments which reverse any 
aspect of body weight disorder-like symptoms should be 
considered as candidates for human body weight disorder 

20 therapeutic intervention. Dosages of test agents may be 

determined by deriving dose -response curves, as discussed in 
Section 5.5.1, below. 



5.4.3. COMPOUNDS AND METHODS FOR THE TREATMENT 
25 OF BODY WEIGHT 

Described below are methods and compositions whereby 
body weight including body weight disorders, including 
obesity, cachexia and anorexia may be treated. Because a 
loss of normal tub gene product function results in the 

30 development of an obese phenotype, an increase in gene 
product activity would facilitate progress towards a normal 
body weight state in individuals exhibiting a deficient level 
of inte gene expression and/or tub gene product activity. 

Alternatively, symptoms of certain body weight disorders 

35 such as, for example, cachexia, which involve a lower than 
normal body weight phenotype, may be ameliorated by 
decreasing the level of £ufe gene expression and/or £jib gene 
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product activity. For example, tub gene sequences may be 
utilized in conjunction with well-known antisense. gene 
"knock-out," ribozyme and/or triple helix methods to decrease 
the level of £ub gene expression. Such methods can also be 
5 useful for agricultural applications in which a more 

favorable fat: level body mass ratio (j^, a decreased ratio) 
is desired. 

With respect to an increase in the level of normal ^ 
gene expression and/or tub gene product activity, tub gene 
10 nucleic acid sequences, described, . above, in Section 5.i, 
can, for example, be utilized for the treatment of body ' 
weight disorders, including obesity, such treatment can be 
administered, for example, in the form of gene replacement 
therapy. Specifically, one or more copies of a normal ^uh 
15 gene or a portion of the ^ gene that directs the production 
of a tMb gene product exhibiting normal ^ gene function, 
may be inserted into the appropriate cells within a patient, 
using vectors which include, but are not limited to 
adenovirus, adeno-associated virus, and retrovirus vectors, 
20 in addition to other particles that introduce DNA into cells, 
such as liposomes. 

Because the ^ gene is expressed in the brain, 
including the hypothalamus, such gene replacement therapy 
techniques should be capable delivering £ub gene sequences to 
25 these cell types within patients. Thus, the techniques for 
delivery of fiufe gene sequences should be able to readily 
cross the blood-brain barrier, which are well known to those 
of skill in the art (see, g^, pct application, publication 
No. WO89/10134, which is incorporated herein by reference in 
30 its entirety), or, alternatively, should involve direct 

administration of such ^ gene sequences to the site of the 
cells in which the £ub gene sequences are to be expressed. 
With respect to delivery which is capable of crossing the 
blood-brain barrier, viral vectors. such as, for example, 
35 those described above, are preferable. 

Additional methods which may be utilized to increase the 
overall level of ^ gene expression and/or ^ gene product 
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activity include the introduction of appropriate tub- 
expressing cells, preferably autologous cells, into a patient 
at positions and in numbers which are sufficient to 
ameliorate the symptoms of body weight disorders, including 
5 obesity. Such cells may be either recombinant or non- 
recombinant . 

Among the cells which can be administered to increase 
the overall level of tub gene expression in a patient are 
normal cells, preferably hypothalamus cells, which express 

10 the tub gene. Among the hypothalamic cells which can be 
administered are hypothalamic cell lines, which include, but 
are not limited to the GTl-i cell line (Melcangi, R.c. et 
al., 1995, Endocrin. 136:679-686). 

Alternatively, cells, preferably autologous cells, can 

15 be engineered to express tub gene sequences which may then be 
introduced into a patient in positions appropriate for the 
amelioration of body weight disorder symptoms. Alternately, 
cells which express the tjib gene in a wild type in MHC 
matched individuals, i.e. . non-tub individual, and may 

20 include, for example, hypothalamic cells. The expression of 
the £ub gene sequences is controlled by the appropriate gene 
regulatory sequences to allow such expression in the 
necessary cell types. Such gene regulatory sequences are 
well known to the skilled artisan. Such cell -based gene 

25 therapy techniques are well known to those skilled in the 
art, see, s.^, Anderson, F., U.S. Patent No. 5,399,349. 

When the cells to be administered are non- autologous 
cells, they can be administered using well known techniques 
which prevent a host immune response against the introduced 

30 cells from developing. For example, the cells may be 

introduced in an encapsulated form which, while allowing for 
an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be 
recognized by the host immune system. 

35 Additionally, compounds, such as those identified via 

techniques such as those described, above, in Section 5.4.2, 
which are capable of modulating tub gene product activity can 
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be administered using standard techniques which are well 
known to those of skill in the art. In instances in which 
the compounds to be administered are to involve an 
interaction with brain cell types such as, for example, 
5 hypothalamic cell types, the administration techniques should 
include well known ones which allow for a crossing of the 
blood-brain barrier. 

5.5. PHARMACEUTICAL PREPARATIONS AND 
METHODS OF ADMINISTRATION 

The compounds that are determined to affect tub gene 
expression or gene product activity can be administered to a 
patient at therapeutically effective doses to treat or 
ameliorate weight disorders, including obesity. A 
15 therapeutically effective dose refers to that amount of the 
compound sufficient to result in amelioration of symptoms of 
body weight disorders. 

5.5.1. EFFECTIVE DQSE 

20 Toxicity and therapeutic efficacy of such compounds can 

be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, e.g. , for determining the 
LD50 (the dose lethal to 50% of the population) and the ED50 
(the dose therapeutically effective in 50% of the 

25 population) . The dose ratio between toxic and therapeutic 
effects is the therapeutic index and it can be expressed as 
the ratio LD50/ED50. Compounds which exhibit large therapeutic 
indices are preferred. While compounds that exhibit toxic 
side effects may be used, care should be taken to design a 

30 delivery system that targets such compounds to the site of 
affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

The data obtained from the cell culture assays and 
animal studies can be used in formulating a range of dosage 

35 for use in humans. The dosage of such compounds lies 

preferably within a range of circulating concentrations that 
include the ED50 with little or no toxicity. The dosage may 
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vary within this range depending upon the dosage form 
employed and the route of administration utilized. For any 
compound used in the method of the invention, the 
therapeutically effective dose can be estimated initially 
5 from cell culture assays. A dose may be formulated in animal 
■ models to achieve a circulating plasma concentration range 
that includes the IC50 (i.e. , the concentration of the test 
compound which achieves a half -maximal inhibition of 
symptoms) as determined in cell culture. Such information 
10 can be used to more accurately determine useful doses in 
humans. Levels in plasma may be measured, for example, by 
high performance liquid chromatography. 

5.5,2. FORMULATIONS AND ll^V. 
15 Pharmaceutical compositions for use in accordance with 

the present invention may be formulated in conventional 
manner using one or more physiologically acceptable carriers 
or excipients. 

Thus, the compounds and their physiologically acceptable 

20 salts and solvates may be formulated for administration by 
inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

For oral administration, the pharmaceutical compositions 
may take the form of, for example, tablets or capsules 

25 prepared by conventional means with pharmaceutically 
acceptable excipients such as binding agents ( e.g. , 
pregelatinised maize starch, polyvinylpyrrolidone or 
hydroxypropyl methylcellulose) ; fillers ( e.g. , lactose, 
microcrystalline cellulose or calcium hydrogen phosphate) ; 

30 lubricants ( e.g. . magnesium stearate, talc or silica); 
disintegrants ( e.g. . potato starch or sodium starch 
giycolate) ; or wetting agents ( e.g. . sodium lauryl sulphate) . 
The tablets may be coated by methods well known in the art. 
Liquid preparations for oral administration may take the form 

35 of, for example, solutions, syrups or suspensions, or they 
may be presented as a dry product for constitution with water 
or other suitable vehicle before use. Such liquid 
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preparations may be prepared by conventional means with 
pharmaceutically acceptable additives such as suspending 
agents (e.g. . sorbitol syrup, cellulose derivatives or 
hydrogenated edible fats) ; emulsifying agents ( e,a. . lecithin 
5 or acacia) ; non- aqueous vehicles ( e.g. , almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and 
preservatives (e.g. . methyl or propyl -p-hydroxybenzoates or 
sorbic acid) . The preparations may also contain buffer 
salts, flavoring, coloring and sweetening agents as 
.10 appropriate. 

Preparations for oral administration may be suitably 
formulated to give controlled release of the active compound. 

For buccal administration the compositions may take the 
form of tablets or lozenges formulated in conventional 
15 manner. 

For administration by inhalation, the compounds for use 
according to the present invention are conveniently delivered 
in the form of an aerosol spray presentation from pressurized 
packs or a nebuliser, with the use of a suitable propellant, 

20 e.g. , dichlorodifluorome thane, trichlorof luoromethane, 

dichlorotetraf luoroethane, carbon dioxide or other suitable 
gas. In the case of a pressurized aerosol the dosage unit 
may be determined by providing a valve to deliver a metered 
amount. Capsules and cartridges of e.g. gelatin for use in 

25 an inhaler or insufflator may be formulated containing a 

powder mix of the compound and a suitable powder base such as 
lactose or starch. 

The compounds may be formulated for parenteral 
administration by injection, e.g. . by bolus injection or 

30 continuous infusion. Formulations for injection may be 

presented in unit dosage form, e.g. . in ampoules or in multi- 
dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or 
emulsions in oily or aqueous vehicles, and may contain 

35 formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may 
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be in powder form for constitution with a suitable vehicle, 

e.g. , sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal 

compositions such as suppositories or retention enemas, e,a, . 
5 containing conventional suppository bases such as cocoa 

butter or other glycerides. 

In addition to the formulations described previously, 

the compounds may also be formulated as a depot preparation. 

Such long acting formulations may be administered by 
10 implantation (for example subcutaneously or intramuscularly) 

or by intramuscular injection. Thus, for example, the 

compounds may be formulated with suitable polymeric or 

hydrophobic materials (for example as an emulsion in an 

acceptable oil) or ion exchange resins, or as sparingly 
15 soluble derivatives, for example, as a sparingly soluble 

salt . 

The compositions may, if desired, be presented in a pack 
or dispenser device which may contain one or more unit dosage 
forms containing the active ingredient. The pack may for 
20 example comprise metal or plastic foil, such as a blister 
pack. The pack or dispenser device may be accompanied by 
instructions for administration. 

6 . EXAMPLE: GENETIC MAPPING OF THE tub LOCUS 
25 In the Example presented herein, studies are described 

which, first, define the genetic interval within which the 
■tytp gene lies, and, second, successfully narrow the interval 
to approximately 0,25 cM. 

30 6.1. MATERIALS AND METHODS 

The tubbv phenotype. The tubby phenotype was assessed 
by weighing the mice. Females weighing less than 35 grams at 
150 days were classified as normal ( i.e. . either +/+ or 
tub/+) , while those weighing greater than 43 grams were typed 

35 as tub / tub . Males weighing more than 55 grams at 150 days 
were classified as tub / tub , while males weighing less than 55 
grams were classified as unknown. 
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The markers used to genotype the crosses were those 
identified and mapped at the Whitehead Institute at the 
Massachusetts Institute of Technology (Dietrich et al . , 1992, 
Genetics 131:423-447), 
5 The European backcross mapping panel (Breen et al., 

1994, Human Mol. Genet. 1:621-627), which consists of a 
C57BL/6J X Mus Soretus backcross, was used to order markers 
within the tub gene interval . 

Hbb protein polymorphism typing is described in Whitney, 
10 J.B, III, 1978, Biochem. Genet. 16:667-672. 

Mouse crosses were performed according to standard 
procedures , 

6.2. RESULTS 

15 The murine tub gene had previously been mapped to 2.4 cM 

+/- 1.4 cM distal of the hemoglobin beta locus (Hbb) on mouse 
chromosome 7 (Jones, J.M. et al . , 1992, Genomics 14:197-199). 

2.4 cM represents a genetic distance measurement 
corresponding to 3 observed genetic crossovers in 125 

20 opportunities. On average, in the mouse genome, this is 
equivalent to a physical distance of approximately 4.8 
million base pairs. This level of genetic resolution, 
however, was not satisfactory for the cloning of the tub 
gene. Further, the region of chromosome 7 containing the tub 

25 gene was not well defined, and no defined markers existed 
which flanked the tub locus. 

Described herein, therefore, are genetic crosses which: 
1) define the chromosomal region surrounding the tub gene, 
and 2) narrow the interval within which the tub gene is 

30 determined to lie to 0,25 cM. 

Specifically, two large crosses segregating the tubby 
phenotype were set up and performed, and were typed with 
available genetic markers known to map within the relevant 
region of chromosome 7. 

35 First, an intercross of (C57BL/6J-tub x DBA/2 J] F^ hybrid 

mice was set up. These hybrid mice were the progeny derived 
from the mating of two inbred stains, C57BL/6J- tub/tub and 
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DBA/2J-+/+. In total, 417 Fj progeny, representing 838 
independent meioses, were analyzed. Typing all informative 
markers against this cross identified a genomic region of 
approximately 4 megabases between the markers D7Mitl7 and 
5 D7Mit281 which contained the tub gene. Two F2 progeny showed 
recombination events between D7Mitl7 and the tub locus, 
thereby establishing this marker as proximal to the tub 
locus. Five recombinant F2 progeny demonstrated that the tub 
locus lies proximal to the D7Mit281 marker, thus placing u:.e 
10 tub locus between the D7Mitl7 and D7Mit281 markers, as shown 
in FIG. 1. The distance between the markers D7Mitl7 and 
D7Mit281 was determined to be about 2.0 cM, thereby narrowing 
the interval within which the tub gene must lie to this 2.0 
cM region. 

15 The tub genetic interval was further narrowed by 

exploiting a by product of the way in which tub stock is 
maintained, tub heterozygotes must be identified in order to 
easily maintain the stock because tub homozygotes have 
reduced fertility. In order to maintain such heterozygotes, 

20 a C57BL/6J-tub strain was crossed with the congenic C57BL/6J- 
Hbb^ strain. This congenic strain is presumed to be 
genetically identical to the C57BL/6J strain except for a 
genomic segment from a wild mouse strain surrounding and 
including the Hbb locus. As a- result, the C57BL/6 J-Hbb^ 

25 strain has an Hbb allele (Hbb^) which can be distinguished 
electrophoretically from the C57BL/6J Hbb allele (Hbb^) . 
Because the Hbb locus is closely linked to the tub locus, 
those animals found to be Hbb^/Hbbf were presumed to be 
heterozygous at the tub locus as well (a subset of animals 

30 were tested for the tubby phenotype later, to assure that no 
recombination between the Hbb and tub loci had taken place) . 

Because the two markers under selection for 
heterozygosity in such a maintenance scheme are Hbb and tub , 
the genomic region between these two loci also remains 

35 heterozygous as the stock is propagated. However, with each 
successive generation, this region will narrow, and the 
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region outside this interval will become homozygous for 
C57BL/6J alleles. 

By genotyping of the parental strains C57BL/6J and 
C57BL/6J-Hbb^, the boundaries of the original congenic 
5 interval surrounding the Hbb locus were established. 

Proximal of the tub locus, the congenic interval includes the 
markers D7Mitl7, 39, 33, 37 and 38. The congenic interval 
extends distally beyond the marker D7Mit222 and includes the 
markers D7Mitl30 and 53. 

10 The genotyping of the C57BL/6J-tub/+-abbf/Hbb! strains 

generated herein, led to the finding that the markers 
D7Mit39, 53 and 22 were homozygous for C57BL/6J alleles in 
each of the animals of this strain which were tested. This 
showed that the congenic interval had been narrowed, through 

15 subsequent generations, to an interval between D7Mit39 and 
D7Mit53 (D7Mit39 is 0.3 CM proximal to D7Mitl7) . Because the 
£ub locus is, by necessity, heterozygous in these animals, it 
must also, therefore, lie within this D7Mit39-to-D7Mit53 
interval. Based on the typing of 982 progeny of the European 

20 backcross mapping panel (Breen et al., 1994, Human Mol. 
Genet, 3:621-627)-, this interval was estimated to be 
approximately 0.5 cM. 

Next, the tub maintenance stock was used as a cross. 
Because heterozygous mice of this stock {C57BL/6J-tub/+) were 

25 heterozygous for markers within the congenic interval, such a 
cross represented an intercross segregating tubby in a 
manner analogous to the £ufe/DBA/2J intercross. 394 meioses 
were geno typed and a single recombinant mouse was identified, 
demonstrating that the tub locus lies proximal to the 

30 D7Mitl30 marker. Thus, at this point in the genetic mapping, 
the proximal boundary of the £ub interval was D7Mitl7, as 
defined by the recombinants isolated from the [C57BL/6J- tub x 
DBA/2J] Fi intercross and the distal boundary of the tub 
interval was D7Mitl3 0, as shown by the recombinant of this 

35 C57BL/6J-tub/+ intercross. The total number of meioses 
genotyped at this point was 1232: 838 meioses in the 
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[C57BL/6J-£iife X DBA/2 J] Pi intercross and 394 meioses in the 
maintenance stock intercross. 

The size of this region was estimated to be 
approximately 0.25 cM on the European backcross panel. On 
5 average in the mouse genome, such a genetic distance 

corresponds to a physical distance of approximately 500 kb. 
This finding led to efforts to clone, the intervening DNA in 
an attempt to isolate the tub gene. 

10 7. EXAMPLE: PHYSICAL MAPPING OF THE 

tub GENE INTERVAL 

As a step toward identifying the fiufe gene, the Example 
presented herein describes the physical mapping of the 
D7Mitl7 to D7Mit53 interval within which the £iib gene was 
determined to lie. 

7.1. MATERIALS AND METHODS 
Yeast artificial chromosom e (YAC) libraries . Two mouse 
genomic YAC libraries were screened in an effort to identify 

2Q specific YACs containing genomic DNA from the tub region. 

The first YAC library, the Whitehead Mouse YAC Library I, was 
obtained from Research Genetics (Huntsville, AL) . The second 
YAC library, the St. Mary's/ICRF YAC library, was a 
composite library made of YACs constructed at St, Mary's 

25 Hospital (London, England) and of YACs constructed at the 
Imperial Cancer Research Fimd laboratories and it was 
obtained from St. Mary's Hospital. 

The YAC libraries were screened by PCR amplification of 
DNA pools representing the libraries. A description of a 
screening protocol can be found in Research Genetics Catalog 
No. 95020. 

The terminal sequences of the YACs were isolated by 
vectorette PCR according to Riley et al., 1990, Nucl . Acids 
Res. i8: 2887-2890) . Sequencing was performed according to 
standard procedures. 

YAC ends were mapped according to the protocol described 
by Tuffrey et al., 1993, Hum. Mut. 2:368-374 for single- 
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Stranded conformational polymorphism (SSCP) analysis, using 
SSCPs identified between C57BL/6J and Mus spretus (the two 
mouse strains used to generate the European Backcross mapping 
panel). Utilizing the YAC end SSCPs it was possible to 
5 determine that the ends of the YACs mapped between the 
D7Mitl7 and D7Mit53 markers. 

PI bactey j.ophag^ . A mouse genomic PI bacteriophage 
library (Pierce, J.C, et al., 1992, Mamm. Genome 3:550-558) 
was screened using the Genome Systems screening service. For 
10 screening, the ura end of the M72 YAC (M72R) was identified 
via vectorette PGR (Riley et al., 1990, Nucl. Acids Res. 
ia;2887-2890) . M72R was sequenced and two PCR fragments were 
chosen from this sequence, as shown below: 

M72R-f : 5'-TGC 6CA GAA ACA ATC ACC TA-3' ; and 
M72R-r: 5'-CAA GAC GTG AAC CTG GA-3' 

The two primers amplify a 129 bp fragment from mouse genomic 

DNA. The primers were used by Genome Systems screening 

service to screen the mouse genomic PI library. 

Bacterial Arti ficial Chromosomes (BACs) . A MIT/Research 

2Q mouse BAC library obtained from Research Genetics (Catalog 

No. 96023) was screened according to manufacturer's suggested 

screening protocol. 

7.2 RESULTS 

25 Described herein are results which describe the physical 

mapping of the tub region. This region is shown in FIG. l. 
In PIG. 1, genetic markers are indicated above the top line, 
while YACs spanning the region are shown below this. The 
checkered Pl and BAC clones were analyzed by sequence 
sampling and exon trapping (see Section 8, below). Overlaps 
between clones were identified by PCR amplification of clones 
with physical markers in the region. The tub gene, as 
described, below, in this section, was mapped between D7Mitl7 
and D7Mit53. 

35 The markers D7Mit 127, 219, 63, 280, 236 and 130 were 
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mapped between the D7Mitl7 and D7Mit53 markers on the 
European Backcross panel (Breen et al., 1994, Human Mol . 
Genet. 3:621-627). These markers, including the D7Mitl7 and 
D7Mit53 markers, were used, therefore, to screen the MIT YAC 
5 library. 

Screening with these markers resulted in the 
identification of a set of YACs which constituted two 
contigs. Specifically, the contig around D7Mitl7 included 
YACs M65, M70 and M72, while the contig around D7Mit53 

10 included M4 9, M79 and M31. 

In order to clone the gap between the two YAC contigs, 
physical PGR markers at the ends of the YACs were 
established, via vectorette PGR (Riley, 1990, Nucl . Acids 
Res. 18:2887-2890), with which to rescreen the YAC library. 

15 The resulting PGR products were sequenced and PGR screening 
primers were chosen. The trp ends of YACs M70 and M31 were 
isolated (trp ends will be referred to herein as the left end 
of the YACs, e>q. , M70L, while the ura ends will be referred 
to herein as the right ends) , and were genetically mapped, as 

20 described, above, in Section 6.1, to the £ub region of mouse 
chromosome 7 in order to show that they were not derived from 
chimeric YACs. These ends were then used to screen the St. 
Mary's/ICRF YAC library. 

One YAC, M84 , was identified by both M70L and M31L. 

25 Thus, a single contig spanning the D7Mitl7 to D7Mit53 was 
established. The minimal contig consisted of M65, M72, M84 , 
M31, M79 and M49, as shown in FIG. 1. 

In order to further aid in gene identification and to 
confirm the integrity of the YAC contig described above, PI 

30 bacteriophage and bacterial artificial chromosomes (BAGs) 
were established for the interval between D7Mitl7 and 
D7Mitl30. These PI clones and BAGs overlap to form three 
contigs separated by two gaps, as shown in FIG, 1. 

35 8 . EXAMPLE: IDENTIFICATION OF A CANDIDATE tub GENE 

In the Example presented herein, a gene is identified, 
via exon trapping and sequence sampling, within the cloned 
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DNA described in the Example presented, above, in Section 1 , 
which corresponds to a candidate tub gene. Specifically, 
Section 8.1 describes the exon trapping and sequencing 
analyses, while Section 8.2 describes the cloning of putative 
5 tub gene cDNA clones, 

8.1. EXON TRAPPING AND SEQUENCE SAMPLING 
OF tub GENE INTERVAL DNA 

MATERIALS AND METHODS 
Eleven PI (Pi, P2, P3, P4, P6, P7, P8, PIO, Pll, Pi3 and 
P14) and twelve BAG (Bl, B2, B3, B4, B5, B6, B7, B9, B12, 
B13, B14 AND B15) clones were subcloned into the D-pSPL3, 
vector, exon trapped and sequence sampled, as described 
below. 

15 Exon trapping. The exon trapping analysis was performed 

using Gibco BRL Exon Trapping System (Cat. No. 18449-017) and 
using the D-pSPL3 vector, a modified version of the pSPL3 
vector (Gibco BRL Life Sciences) . In this system, exons are 
trapped from genomic DNA subcloned into the vector as a 

2Q result of the interaction between the vector splice site and 
splice sites flanking exons in the genomic DNA. 

D-pSPL3 was derived from the splicing vector pSPL3 
(Gibco BRL Life Sciences) by deletion of the Ndel (1119) -Nhel 
(1976) fragment in the HIV tat intron to eliminate the 

25 c^ryptic splice-donor site at position 1134 in the pSPL3 

sequence. Stocks of BamHI-cut and Pstl-cut D-pSPL3 DNA were 
prepared by digesting 50-100/zg DNA with the corresponding 
enzyme and dephosphorylating the linearized vector with calf 
intestinal alkaline phosphatase as specified by the 

3Q manufacturers (New England Biolabs and Boerhinger Mannheim, 
respectively) , The linearized vector was purified away from 
uncut plasmid DNA by agarose gel electrophoresis and 
electroelution and assayed to assess the level of uncut and 
self-ligated vector as. described elsewhere (Pulido and Duyk, 

2g 1994, in Current Protocols in Human Genetics, Wiley Pub., pp 
2. 2,1-2, 3.1) . 
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Briefly, Pi and BAG clone DNA was prepared from 
overnight cultures (100 ml LB/kanatnycin 25 /xg/ml) by standard 
alkaline lysis, treated with RNase A, purified by 
phenol/chloroform/isoamyl alcohol (25:24:1) extraction, 
5 ethanol precipitated, rinsed in 70% ethanol, dried and 
resuspended in 400/il deionized water. 

S-lO/ig Pl/BAC DNA was cut with either BamHI and Bglll, 
or PstI, as specified by the manufacturer (New England 
Biolabs, Beverly, MA) . The digested DNA was phenol 

10 extracted, ethanol precipitated and resuspended in 50 /il 
deionized water. 

Exon trapping was then completed as described in the 
Gibco BRL Exon Trapping Manual, Briefly, the D-pSPL3 clones 
were transfected into COS- 7 cells. RNA was isolated and 

15 first strand cDNA was synthesized. Two rounds of nested PGR 
specifically amplified transcripts derived from the D-pSPL3 
clones. These PGR products were cloned into the vector 
pAMPlO, Glones from this pAMPlO library of trapped fragments 
were then analyzed by PGR to determine insert sizes. Glones 

20 with insert sizes greater than 150 bp were sequenced using 
M13 forward and reverse primers. One of the D-pSPL3 
subclones was designated ium008p004, and was sequenced. 

A 90 bp fragment, designated P8X1, was PGR amplified 
using the sequence of this subclone insert . The P8X1 

25 fragment was generated using two PGR primers which were 

designed using the ium008p004 sequence as follows: 

P8X1F1: 5'-GGG GAT AGA GAG TGT GTG AT- 3' 
P8X1R1: 5 '-GAG GAG AAA TGT GCT AGG CT-3' 

The 90 bp P8X1 DNA fragment was PGR amplified from first 

strand cDNA made from C57BL/6J mouse brain RNA. Standard 

cDNA synthesis and PGR procedures were utilized. 

Sequence samplina . Sequence sampling is a technique for 

rapidly determining whether coding sequences were present in 

a nucleic acid sample of interest (See Glaverie, J.M., 1994, 

35 Genomics 23:575-581) . The inserts in D-pSPL3 clones 

described above were sequenced in both orientations using the 

following primers: 
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SPL3A: 5' -CAT GCT CCT TGG GAT GT-3' 
SPL3C: 5'-TGA GGA TTG CTT AAA GA-3' 

After vector trimming and quality assessment, the resulting 

sequences were compared to nucleic acid and protein databases 

g using BLAST algorithms (Altschul, S.F. et al . , 1990, J. Mol . 

Biol. 215:403-410) . 

RESULTS 

In order to look for genes within the cloned DNA, 

j^jj described, above, in Section 7, within the interval 

containing the tub gene, PI and BAC clones were subcloned 

into the D-pSPL3 vector and exon trapped and sample 

sequenced, as described, above, in the Materials and Methods 

portion of this section. One of the D-pSPL3 subclones, 

designated ium008p004, was derived from a D-pSPL3 library 

made from the P8 clone (see FIG. 1) . A 327 base pair portion 

of the PI insert in ium008p004 was sequenced. The protein 

sequence encoded by this portion of ium008p004 showed 

homology to two translated sequences in the GenBank nucleic 

2Q acid database. Two primers were selected from the region of 

homology and used to amplify a DNA fragment of 90 bp, called 

P8X1, having the following sequence: 

5'GAGACAAATG TCCTAGGCTT CAAGGGACCT CGGAAGATGA GTGTGATCGT 
CCCAGGCATG AACATGGTTC ATGAGAGAGT CTGTATCCGC 3' 



25 



30 



35 



The ium008p004 homologies were to Genbank sequences 
Z48334 and X69827. Z48334 is the partial sequence of a 
Caenorhabdi t i s eleqans cosmid, F10B5. One of the putative 
genes identified within this sequence contains a 425 amino 
acid open reading frame, designated F10B5.4 (Wilson R. et al, 
1994, Nature 168:32-38). X69827 is a mouse 981 bp partial 
cDNA with a potential open reading frame of 323 amino acids. 
This sequence has been shown to have similarity to the family 
of phosphodiesterase proteins (Vambutas, V. and Wolgemuth, 
D.J., 1994, Biochim. Biophys. Acta. 1212:203-206). 

The above sequence was flankecJ by consensus splice 
sites, further demonstrating that the sequence is from an 
exon, or a coding region, of a gene. The homology to a known 
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gene, as described above, coupled with the presence of 
consensus splice sites strongly suggested that this region of 
the ium008p004 clone corresponded to a portion of the coding 
region of a gene. Given its location within the interval in 
5 which the tub gene is located, this putative gene, which was 
designated CBT9, represented a tub gene candidate. 

8.2 ISOLATION OF CBT9 cDNA CLONES 
MATERIALS AND METHODS 

10 cDNA cloning . In order to isolate a longer cDNA of the 

CBT9 gene, the P8X1 fragment was used as a probe to screen a 
Stratagene (La Jolla, CA) mouse brain cDNA library (#936309) . 
For hybridization, Amersham Rapid Hyb Buffer (Cat. No. 
RPN1639) was utilized according to manufacturer's protocol. 

15 A final washing stringency of was 2x SSC/0,1% SDS at 65^C was 
attained and autoradiography was performed overnight. One 
million clones were screened. Among the clones identified 
was the fume009 clone, a 1.15 kb cDNA, which was then 
sequenced . 

20 The fume009 clone was used to screen a mouse 

hypothalamus cDNA library. This library was constructed from 
poly-A* RNA from 6 week old C57BL/6J mice. First and second 
strand cDNA was made from the poly-A* RNA using standard 
procedures. cDNA was ligated into Uni-ZAP XR lambda vector 

25 and packaged using a Stratagene kit (Cat. No. 237611) . 
Identical washing conditions as described above were 
utilized. The screen identified a 6.0 kb clone, designated 
fumhOlS, which was sequenced. The fumh019 cDNA clone 
contains the entire CBT9 gene coding region. The CBT9 

30 sequence is further discussed, below, in the Example 
presented in Section 12. 

RESULTS 

In order to isolate CBT9 cDNA. clone, the PBXl fragment 
35 was used, as described, above, in the Materials and Methods 
portion of this section, to screen a mouse brain cDNA 
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library. This screen resulted in the isolation of the 
fuineOOS 1.15 kb cDNA clone. 

The fume009 cDNA clone was then used, as described, 
above, in the Materials and Methods portion of this section, 
5 to screen a mouse hypothalamus cDNA library. This screen 
resulted in the isolation of a 6.0 kb cDNA clone, designated 
fumh019. 

The fumh019 cDNA clone was sequenced and was determined 
to contain the entire CBT9 coding region. The CBT9 
10 nucleotide and amino acid sequence are described, below, in 
the Example presented, below, in Section 12. 

9. EXAMPLE: CHARACTERIZATION OF THE EXPRESSION OF THE 
CBT9 GENE 

In the Example presented herein. Northern analysis data 
is described which characterizes the CBT9 gene (see Section 
8, above) . Specifically, experiments are presented herein 
which evaluate the expression of CBT9 in a number of mouse 
tissues obtained from wild type and tyfe mice. The results 
2Q presented herein are consistent with the CBT9 gene being the 
tub gene. 

9.1 MATERIALS AND METHOnfi 

ygrth^rn anaJrVgig. The PSXl DNA fragment and the 
25 fume009 cDNA clone were used to probe Northern blots 
containing total mouse RNA. 

Total RNA from £iib and wild type {C57BL/6J) mice was 
isolated and utilized for the Northern analysis. All mice 
were sacrificed by carbon dioxide asphyxiation. Tissues were 
3P dissected on ice, snap- frozen in liquid nitrogen and stored 
at -80" C. Total RNA was isolated using RNazolB (TelTest, 
Inc.) The total RNA samples were resuspended in RNase-free 
DEPC- treated water and quant itated by optical density 
measurement . 

35 For the Northern blots, lO/xg total RNA of each sample 

was loaded onto a formaldehyde gel. The gel was blotted onto 
a nylon membrane using standard Northern transfer techniques. 
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The blot was hybridized with P8X1 which had been 
radiolabelled by random priming using a Gibco-BRL kit (Cat. 
No. 18187-013) according the manufacturer's recommended 
protocol. For hybridization, Amersham Rapid Hyb Buffer (Cat. 
5 No. RPN1639) was utilized according to manufacturer's 

protocol. A final washing stringency of O.lx SSC/0.1% SDS at 
es^'C was attained, and autoradiography was performed 
overnight . 

The Northern blot depicted in FIG, 2 was loaded as 

10 follows: lane 1, wild type brain without hypothalamus; lane 
2, tub brain without hypothalamus; lane 3, wild type 
hypothalamus; lane 4, tub hypothalamus; lane 5, wild type 
heart; lane 6, tub heart; lane 7, wild type lung; lane 8, tub 
lung; lane 9, wild type liver; lane 10, tub liver; lane 11, 

15 wild type kidney; lane 12, Jtufe kidney; lane 13, wild type 
spleen; lane 14, tub spleen; lane 15, wild type stomach; lane 
16, tub stomach; lane 17, wild type muscle; lane 18, tub 
muscle; lane 19, wild type fat; lane 20, £ufe fat; lane 21, 
wild type testis; lane 22, tub testis; lane 23, RNA molecular 

20 weight standards, the sizes of which are indicated by the 
lines at the right hand side of the blot. Specifically, the 
sizes are 9.49 kb, 7.46 kb, 4.40 kb, 2.37 kb, 1.35 kb and 
0.24 kb. The crosses indicate the positions of the 288 and 
18S ribosomal RNA molecules. "Wild type" refers to C57BL/6J 

25 mice. 

The Northern blot depicted in FIG. 3 was loaded as 
follows: lane 1, RNA molecular weight standards, the sizes 
of which are indicated by the lines at the side of the blot 
(specifically, such sizes are 9.49 kb, 7.46 kb, 4.40 kb, 2.37 

30 kb, 1.35 kb and 0.24 kb) ; lane 2, wild type brain without 
hypothalamus; lane 3, brain without hypothalamus; lane 4, 
wild type hypothalamus; lane 5, tub hypothalamus; lane 6, 
wild type heart; lane 7, tub heart; lane 8, wild type lung; 
lane 9, tub lung; lane 10, wild type liver; lane 11, tub 

35 liver; lane 12, wild type kidney; lane 13, tub kidney; lane 
14, wild type spleen; lane 15, tub spleen; lane 16, wild type 
stomach; lane 17, tub stomach; lane 18, wild type muscle; 
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lane 19, tub muscle; lane 20, wild type fat; lane 21, tub 
fat; lane 22, wild type testis; lane 23, tub testis. The 
crosses indicate the positions of the 28S and 18S ribosomal 
RNA molecules. "Wild type" refers to C57BL/6J mice. 

5 

9.2. RESULTS 

As shown in FIG. 2, a CBT9 transcript of approximately 
7.0 kb is present in the hypothalamus without brain (lane 2) 
and in the hypothalamus (lane 4) RNA samples derived from the 

10 wild type C57BL/6J mice as detected by the P8X1 probe. No 
CBT9 transcript is detectable in other total RNA samples 
derived from wild type mouse tissues. 

As is further shown in FIG. 2, a CBT9 transcript of 
approximately 7.5 kb, i.e. . approximately 0-5 kb larger than 

15 the transcript seen in the wild type tissues, is present in 
both the brain without hypothalamus (lane 3) and hypothalamus 
(lane 5) RNA samples derived from the tub mice as detected by 
the P8X1 probe. No CBT9 transcript is detectable in other 
samples of total RNA derived from tub mouse tissues. It 

20 should additionally be noted that the abundance of the 

transcript detected by the P8X1 probe in tub RNA samples is 
approximately 5- fold greater than it is in RNA samples from 
wild type {C57BL/6J) mice. 

In addition, the fume009 clone was used as a probe to 

25 verify the results, described above, which were obtained 
using the P8X1 fragment as a probe. As shown in FIG, 3, 
Northern analysis using such a fume009 sequence to probe 
total RNA from tub and wild type mouse tissue samples yielded 
the same CBT9 results which were observed using the P8X1 

30 probe. Specifically, a transcript of the same increased size 
was seen in the total RNA samples derived from tub homozygous " 
mice and the same up regulation was observed in the amount of 
tub RNA present in total RNA samples derived from tub 
homozygous animals relative to wild type animals. 

35 A Northern blot analysis of total RNA derived from an 

animal genotypically shown to be heterozygous for the tub 
mutation revealed, as expected from the above results, the 
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presence of both the 7.5 kb and 7.0 kb transcripts in total 
brain RNA. In addition, a moderate up regulation 
(approximately two- fold) of CBT9 transcript levels relative 
to CBT9 levels in wild type animals, was observed. 
5 The results of these Northern smalyses strongly suggest 

that a defect within the CBT9 gene results in the tubby 
phenotype. Specifically, the difference in size observed 
between the CBT9 transcript in wild type and in tijb RNA is 
consistent with a mutation resulting in the inclusion of 

10 exogenous nucleic acid into the ^yfe mRNA. Second, the 

approximately 5 -fold up regulation of CBT9 RNA levels in the 
RNA samples derived from the tub/tub homozygotes relative to 
levels observed in RNA samples derived from the wild type 
mice suggests that such high levels of this transcript are 

15 related to the obesity phenotype seen in tubby animals. This 
may be the result of a negative feedback loop induced by the 
absence or malfunction of the protein encoded by the mutant 
tab {CBT9) gene: Third, in total mouse RNA, the CBT9 gene is 
expressed in the brain, including the hypothalamus, a region 

20 of the brain which is known to be involved in the control of 
body weight (Bray, G.A. , 1992, Progress in Brain Res. 21:333- 
341) . Finally, the moderate up regulation seen in the 
heterozygous animals is consistent with the recessive 
inheritance pattern of the tubby phenotype, in which 

25 heterozygotes are not obese, but, nonetheless, have been 
shown to exhibit some phenotypic differences relative to 
homozygous wild type control animals (Nishina, P.M. et al,, 
1994, Metabolism 43:554-558) . 

30 10. EXAMPLE; CBT9 SOTr fHERN BLOT ANALYSIS 

. In the Example presented herein, the results of a 
Southern blot analysis are described which indicate that 
homologs of the murine CBT9 gene are present and have been 
conserved in other mammalian species. 
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10 



10.1. MATERIAL AND METHODS 

Southern blot analysis. Two PGR primers were designed 

from the CBT9 nucleotide coding sequence, as follows; 

P8X9F1: 5'-GGA CAA GAA GGG GAT GGA 
P8X10R1: 5'-CCG TGG ATG ATC TGG AAG T-3' 

The primers were used to amplify, via RT-PCR, a 650 bp cDNA 

fragment (designated P8X9-10) from C57BL/6J mouse whole brain 

RNA. Standard RT-PCR conditions were utilized. The band was 

gel -purified and random-prime radiolabelled, as described 

above . 

The resulting probe was hybridized to a Southern blot of 
EcoRI -digested genomic DNA (BIOS Laboratories; #EBM-100E) 
from various mammals. Each lane was loaded with' 8 fig of 
digested genomic DNA. For hybridization, Amersham Rapid Hyb 
Buffer (Cat. No. RPN1639) was utilized according to 
manufacturer's protocol. A final washing stringency of 0-5x 
SSC/0.1% SDS at SS^'C was attained, and blots were exposed 
overnight with .an intensifying screen at -80®C. 

The lanes of the Southern blot depicted in FIG. 5 were 
2Q loaded as follows: lane 1, markers: lambda DNA digested 
with Hindlll (band sizes are as indicated in the figure) ; 
lane 2, mouse; lane 3, hamster; lane 4, rat; lane 5, rabbit; 
lane 6, dog; lane 7, cat; lane 8, cow; lane 9, sheep; lane 
10, pig; lane 11, marmoset; lane 12, human. 

25 

10.2 RESyLTg 
A Southern blot analysis was conducted using a CBT9 
probe (P8X9-10; see Section 10.1 for details) and a DNA blot 
containing EcoRI -digested mammalian genomic DNA of various 
species, as described above, in Section 10.1. As is shown in 
FIG. 5, the CBT9 probe detects homologous sequences in each 
of the mammalian DNA sample represented on the blot. This 
result provides additional evidence that the CBT9 sequence 
used as a probe is part of a gene and, additionally, 
demonstrates that the sequences show a high level of 
conservation among a wide range of mammalian species. 
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EXAMPLE : CBT9 IN S ITU HYBRIDIZATTOW awi^ T.vgTC 

In the Example presented herein, the results of an iq 
situ hybridization analysis are described which verify that 
the CBT9 gene is expressed in the brain. Primary CBT9 gene 
5 expression occurred within the hippocampus, hypothalamus and 
cortex. Weaker hybridization could be seen throughout the 
brain. 



11-1 MATERIALS AND METHnn.c! 

10 In situ Hybridization Localization: Brains from 6 

month-old C57 BL/6J mice were removed flash frozen at -SCC 
and stored at -SO^C until use. lO fim frozen sections of 
brains were post-fixed with 4% PFA/PBS for 15 minutes. After 
washing with PBS, sections were digested with 1 ng/ml 

15 proteinase K at 37»C for 15 minutes, and again incubated with 
4% PFA/PBS for 10 minutes. Sections were then washed with 
PBS, incubated with 0.2 N HCl for 10 minutes, washed with 
PBS, incubated with 0.25% acetic anhydride/1 M 
triethanolamine for 10 minutes, washed- with PBS and 

20 dehydrated with 70% ethanol and 100% ethanol. Hybridizations 
were performed with "S-radiolabelled (5x10' cpm/ml) cRNA 
probes encoding a 1.15 kb segment of the coding region of the 
mouse clone fume009 in the presence of 50% formamide, 10% 
dextran sulfate, Ix Denhardt's solution, 600 mM NaCl, 10 mM 

25 DTT, 0.25% SDS and 100 /zg/ml tRNA for 18 hours at 55»C. 

After hybridization, slides were washed with 5x SSC at 55»C, 
50% formamide/2x SSC at 55«'C for 30 minutes, 10 mM Tris- 
HCKpH 7.6) /500 mM NaCl/l mM EDTA (TNE) at 37»C for 10 
minutes, incubated in 10 ng/ml RNase A in TNE at 37«»C for 30 

30 minutes, washed in TNE at 37«»C for 10 minutes, incubated once 
in 2x SSC at 50«»C for 30 minutes, twice in 0.2x SSC at 50«>C 
for 30 minutes, and dehydrated with 70% ethanol and 100% 
ethanol , Localization of mRNA transcripts was detected by 
film emulsion autoradiography followed by dipping slides in 

35 photo-emulsion for precise autoradiographic localization. 
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11.2 RESULTS 

The fume009 cDNA clone was used as a probe for an in 
situ hybridization analysis. Specifically, the 1.15 kb 
fume009 probe was hybridized to sections of wild type 
5 C57BL/6J mice. As shown in FIG. 5, the CBT9 transcript is 
. expressed in the hypothalamus and other regions of the brain, 
consistent with the above-described Northern analysis data, 
which was presented in Section 9, above. 

Specifically, an mRNA transcript hybridizing to the 1.15 
. 10 kB fume009 antisense cRNA probe was localized to discrete 

regions of the brain of both C57BL/6J wild-type mice (FIG. 5) 
and tub homozygous mice. Signal was observed in the 
hypothalamus adjacent to the 3rd ventricle in two "nuclear 
bodies" {indicated by dense clustering of nuclei) as well as 
15 at the base of the hypothalamus adjacent to the optic chiasm 
in the tissue from both mice. Thus, expression in the 
hypothalamus is highest in the paraventricular, ventromedial 
and arcuate nuclei. 

In addition, signal was detected in scattered cells in 
20 the subcortical temporal lobe and in hippocampus in the 

tissue sections from both mice. FIG. 5 shows the regions of 
localization of tub gene transcript in the brain of C57BL/6J 
mice {the arrows indicate those regions where signal was 
detected) . Weaker hybridization was observed throughout the 
25 brain. No distinct signal was observed in heart, spleen, 
liver, lung, skeletal muscle, pancreas, small intestine and 
stomach of either the C57 BL/6J wild- type mice or tub 
homozygous mice, 

30 12. EI^W^PliE; IDENTIFICATION OF CBT9 AS THE l^ub CVKP. 

Presented in this Example is, first, a mutational 
analysis of the CBT9 gene, which compares CBT9 gene sequences 
within nucleic acid derived from wild type and tub animals. 
Specifically, a CBT9 splice site mutation is identified 

35 within tub genomic DNA which is absent from wild type genomic 
DNA. Second, the nucleotide and derived amino acid sequence 
of the CBT9 gene is presented. The results disclosed herein, 

- 80 - 



wo 97/02048 



PCTAJS96/11186 



coupled with the results presented, above, in Sections 6 to 
11, identify the CBT9 gene to be the tufe gene, 

12.1 MATERIALS AND METHODfi 

5 PGR analysis. A number of primers were designed to 

amplify the entire open reading frame of CBT9 from tub and 

wild type mice in order to identify the location of the 

mutation in the tub gene. The following two primers 

amplified different sized cDNA fragments when amplifying tub - 

10 derived versus wild type-derived nucleic acid: 

PXIR: 5'-TGA GAC AAA TGT CCT AGG CT-3' 

(corresponding to CBT9 base pair 1113 to 
1132); 

PX12R: 5'-TGG ACA GAG CAA TGG CGA AG- 3' 

(corresponding to CBT9 base pair 1489 to 1470) 
Standard PGR conditions and sequencing procedures were 
utilized. 



12.2. RESULTS 
12.2.1. GBT9 MUTATIONAL ANALYSIS 

20 order to more definitively show that the CBT9 gene 

corresponded to the tub gene, a PGR study was conducted to 
define the mutation causing the GBT9 transcript size change 
observed in £ijb mice relative to the CBT9 transcript size 
observed in RNA of wild type mice. Of the PGR primer pairs 

25 utilized, only one resulted in a size differential between 
the fragment amplified using Jtub-derived nucleic acid and the 
fragment amplified using wild type -derived nucleic acid (see 
Section 10. 1 for details). 

Specifically, utilizing this primer pair ( i.e. . pxiR and 
PX12R) a CDNA was amplified from wild type (C57BL/6J) brain 
RNA which was about 350 bp/ while a cDNA fragment was 
amplified from tub brain RNA which was about 8 00 bp. The 
amplification of both wild type (G57BL/6J) and tub genomic 
DNA resulted in a band of approximately 90 0 bp. 

35 It should be noted that the size differential, 

approximately 450 bp, between the tub and wild type cDNA 
amplified fragments roughly corresponds to the difference in 
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transcript size (i.e. . 7 vs. 7.5 kb) observed between tub and 
wild type RNA in the CBT9 Northern analysis described, above, 
in the Example presented in Section 9. By sequencing (see 
below) it was determined that the precise size difference 
5 between the tub and wild type cDNA amplified sequences was 
398 bp. 

The 90 0 bp fragment amplified from genomic DNA reveals 
the presence of a second intron within the amplified region. 
Only one of these introns (of approximately 100 bp in length) 
10 was processed correctly in the tub animals, as discussed 
below. 

The cDNA and genomic amplified fragments in the region 
of the mutation were sequenced and the wild type- and tub - 
derived sequences were aligned, as shown in FIG. 7. For 

15 orientation of the genomic sequence depicted in FIG. 7 with 
the full length CBT9 cDNA coding sequence shown in FIG. 6, 
bases 1-12 and 411-437 in FIG. 7 correspond to bases 1373- 
1384 and 1385-1411 of FIG. 6, In FIG. 7, the two top 
sequences are from genomic DNA derived from tub and wild type 

20 C57BL/6J mice, as indicated. The bottom two sequences are 
derived from cDNA from tub and wild type mice, as indicated. 
The vertical arrow shows the position of the tub mutation. 
The horizontal box indicates the consensus splice site 
sequence in C57BL/6J which is abolished in the tub genomic 

25 DNA. The asterisks indicate the intron which is erroneously 
not spliced out of the mature tub mRNA. 

The portion of the CBT9 gene sequence in FIG. 7 depicts 
only the genomic region near the mutation site. This 
alignment revealed a single base pair difference of a G to T 

30 transversion in the first base of the splice site (GTGACT; 
see boxed region of FIG. 1) of the intron between base 1384 
and 1385 of the open reading frame of the genomic DNA. This 
mutation abolishes the splice site, resulting in retention of 
an intron of approximately 450 bp dn the amplified cDNA 

35 derived from the tub RNA. To confirm that the identified 
sequence change did not simply represent a polymorphism, the 
splice site was sequenced in 32 additional mouse strains. In 
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each of the strains, the DNA sequence at the putative 
mutation site was identical to that observed in the wild type 
C57BL/6J strain. 

FIG. 8. depicts a schematic representation of the 
5 splicing defect within the CBT9 in tyb mice. The top half of 
the figure shows the normal, wild type splicing of the intron 
from C57BL/6J RNA and the predicted carboxy terminus of the 
wild type CBT9 protein. The G to T mutation of the first 
base of the intron within the CBT9 gene in tub mice abolishes 

10 splicing of this intron, causing the intron to be retained 
within the mature mRNA. The predicted tub mutant CBT9 
protein, therefore, is abnormal. Specifically, due to 
translation of intronic sequence, this mutant tub gene 
product lacks the final 44 amino acid residues of the normal 

15 CBT9 protein and, instead, contains 24 intron-encoded amino 
acid residues at its carboxy terminus which are erroneously 
added to the tub protein until a stop codon within the 
intronic sequence is reached. 

20 12.2.2. CBT9 NUCLEOTIDE AND A MINO ACID 5EQUENOT 

As discussed in Section 8.2, above, the fumh019 CBT9 
cDNA clone was sequenced. Sequencing revealed that the 
fumh019 cDNA clone contained the entire CBT9 open reading 
frame . 

25 The nucleotide sequence and amino acid sequence of CBT9 

is shown in FIG. 6. The CBT9-encoded protein is 505 amino 
acid residues in length. CBT9 is a novel gene, with no 
identical sequences present in published databases. The 
entire CBT9 coding region of the Mus spretus and A/J mouse 

30 strains were sequenced and no non- conservative amino acid 
changes in either strain as compared to the C57BL/6J tub 
sequence were found. 

The CBT9 gene product is a hydrophilic protein, with an 
estimated pi of 9.2, which lacks any obvious secretary 

35 sequence, mitochondrial transit peptide or transmembrane 

domain. The gene product contains a region consisting of two 
runs of serine amino acid residues separated by eight acidic 
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amino acid residues (amino acid residues 191-211) , which 
could serve as a hinge between domains of the protein. In 
addition, two potential dibasic protease cleavage sites are 
present at amino acid positions 302 and 383, and two 
5 potential glycosylation sites are present at amino acid 
positions 205 and 426. 

The carboxy half of the CBT9 gene product shows 
similarity to several sequences in the pviblic protein 
databases and/or encoded by sequences present in public 

10 nucleotide databases, including p4-6, a mouse testis cDNA 
(Genbank X69827) ; F10B5.4 (Genbank Z48334) , a C. elegans 
genomic sequence; DM87D3S (Genbank Z50688) a Drosophila STS; 
and ys86c04.rl (Genbank H92408) , a human retinal cDNA; as 
well as several rice, maize and Arabidopsis ESTs. With the 

15 exception of the mouse testis cDNA p4-6, none of these 
sequences has been functionally characterized. p4-6 was 
isolated by screening of a cDNA library with a rat 
phosphodiesterase probe (Vambutas, V. & Wolgemuth, D.J. 1994, 
Biochim. Biophys. Acta 1217 : 203-206). 

20 Upon alignment of the CBT9 gene product and the 

sequences showing similarity to CBT9, certain regions were 
shown to be completely conserved. Specifically, the two 
dibasic protease cleavage amino acid residues and the 
cysteine amino acid at the penultimate CBT9 position are all 

25 completely conserved among all the CBT9-related sequences. 

The data presented in Sections 6 to 11 above, including 
mapping data, and Northern and in situ analyses, and the 
mutational analysis data presented in this Section 
demonstrating that the tubby phenotype is associated with a 

30 splicing defect within the CBT9 gene which results in a major 
alteration of the carboxy terminus of the CST9 gene product, 
represent conclusive evidence that the CBT9 gene is the tub 
gene. Specifically, CBT9 maps within the 0.25cM interval 
that the tjib gene has been shown, herein, to map. Further, 

35 the CBT9 gene is expressed in the brain, including the 
hypothalamus, a region known to be involved in body weight 
control. Additionally, the CBT9 transcript in £ufe animals is 
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larger than the CBT9 transcript found in wild type C57BL/6J 
animals and it has been shown herein that this increase in 
size is due to a single base mutation in a CBT9 splice site 
which results in the incorrect splicing of the RNA such that 
5 a 398 nucleotide intron remains within the mature mRNA. As 
a result, the protein which is translated from such a mutant 
transcript exhibits an abnormal carboxy terminus. Presumably 
as a result of this defect, the CBT9 mRNA is upregulated 
approximately 5- fold in homozygous tub / tub mice. The 
10 heterozygous tub/+ mice showed a more modest upregulation, as 
would be expected, given the heterozygous tub phenotype. 
In summary, therefore, the CBT9 gene has successfully been 
identified to be the tub gene. 

15 13. EXAMPLE: CLONING AND CHARACTERIZATION OF THE 

HUMAN tub GENE 

The Example presented herein describes the successful 

cloning and characterization of the human tub gene, which is 

involved in the control of human body weight . Both the human 

20 £li^ gene and gene product exhibit a striking level of 

similarity to the murine tub gene and gene product. 

13.1. MATERIALS AND METHQD5 

P8X5-1 tub probe generation: The 950 base pair P8X5-1 

25 gene cDNA probe was generated by standard PGR 

amplification of the murine cDNA clone fumh019, described, 

above, in Section 8. The following primers were utilized for 

the amplification: 

P8X5R1: 5'-CCG ACT CGA TTG CCA GTG TA -3' 
P8X1F1: 5'-GCG GAT ACA GAC TCT CTC AT -3' 

30 

Upon amplification, the probe was gel purified and 
radiolabelled according to standard protocols. 

cDNA screening: Screening was performed on a human 
fetal brain cDNA library (Clontech #HL1149x) . Hybridization 
was performed for 4 hours at 65** C 'using Amersham Rapid Hyb 
buffer (Cat. # RPN1639) according to the manufacturer's 
protocol. A final washing stringency of l.OxSSC/ 0.1% SDS at 
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50^ C for 20 minutes was achieved. Autoradiography was 
performed overnight. 

DNA sequencing; Standard DNA sequencing techniques we 
utilized for the sequencing of the resulting putative human 
5 tub cDNA clones. 



13.2 RESULTS 
The 950 base pair P8X5-1 murine tub gene probe, 
described, above, in Section 13.1, was used to screen a human 
10 fetal brain cDNA library for clones corresponding to the 



independent positive clones. Among these clones were those 

15 designated CBT9H1, CBT9H2 and CBT9H3, which have been 

deposited with the ATCC. Sequencing revealed that the entire 
coding region of the human £ub gene was contained within 
these partially overlapping clones. 

The nucleotide and derived amino acid sequences of the 

20 human tub gene are shown in FIG. 9. As shown in FIG. 9, the 
human tub gene encodes a 506 amino acid protein. The human 
tub gene product encodes a hydrophilic protein exhibiting an 
estimated pi of 9.2 which lacks any obvious secretory 
sequence, mitochondrial transit peptide or transmembrane 

25 domain. The gene product contains a region consisting of two 
runs of serine amino acid residues separated by a acidic 
amino acid residues (amino acid 191-211) which could serve as 
a hinge between domains of the protein. In addition, there 
are two potential dibasic protease cleavage sites at amino 

30 acid positions 301-306 and 381-384, as well as two potential 
N-glycosylation sites at amino acid residues 206 and 427. 

The human tub gene and gene product exhibit a striking 
similarity to the murine tub gene and gene product. 
Specifically, the human Jtufe gene is 89% identical, at the 

35 nucleotide level, to the murine tub gene. Further, the 506 
amino acid human tub gene product exhibits a 94% identity to 
the 505 amino acid murine tub gene product. Amino acid 
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residue 201 represents the only amino acid insertion between, 
the two ^ gene product sequences. Specifically, the human 
tub amino acid residue 201 corresponds to an insertion 
between murine amino acid residues 200 and 201. The carboxy 
5 half of the tub gene product is particularly highly 

conserved. The final 260 amino acid residues of the human 
and mouse tub gene products differ by only a single residue. 
Specifically, murine tub gene product amino acid residue 399 
is a cysteine, while the corresponding human tub gene product 
10 amino acid residue 400 is serine. 

In summary, the results presented herein represent the 
successful cloning of the human tub gene. 

14. EXAMPLE: HUMAN AND MURINE tub GENE 
15 ALTERNATIVE SPLICTNfl 

The Example presented herein describes the discovery 
that both the human and murine tub genes produce 
alternatively spliced transcripts. Specifically, it is shown 
that tub transcripts are produced which either contain or are 
20 lacking the sequence corresponding to tub exon 5. 

Quantitative variation between the relative amounts of 
alternatively spliced species produced is also described. 

14.1. MATERIAL AND METHODS 
25 RT-PCR. First strand cDNA was synthesized from total 

RNA using Superscript (Gibco-BRL) according to supplier's 
protocol. Subsequent PGR conditions were as follows: Hot 
start with 0.5U AmpliTaq, followed by 30 cycles at 94° C for 1 
minute, 55° C for 1 minute and 72'' C for 1 minute. Products 
30 were electrophoresed on 2% agarose gels. RT-PCR products to 
be sequenced were run on LMP agarose, excised, digested with 
^-Agarase (New England Biolabs) , precipitated and resuspended 
in water. The same conditions were utilized for 
amplification of both human and mouse RNA populations. 
35 The primers utilized for mouse sequence amplification 

were derived from murine tub exons 4 and 6: P8X5R: 5'- CCG 
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ACT CGA TTG CCA GTG TA -3'; and CBT9R5 : 5'- GGA GCT GTT TTC 
ATC CTC ATC -3' . 

The primers utilized for human sequence amplification 
were derived from human t^xb exons 4 and 6: hCBTSFll; 5'- gaa 
5 GGA GAA GAA GGG AAA GC-3' ; and hCBTSRll : 5'- GGG TGT TAC TAT 
TTA GCT GG-3' . 

Other techniques. All other techniques were performed 
according to standard procedures and/or as described in the 
Examples presented above. Primers used for genomic PCR 
10 amplification were derived from %ub exons 4 and 5: X4F1: 5'- 
TTC AAG AGG CCG ACT CGA TT -3'/ and X5R1 : 5' - TTC CTC TGC 
ATC GTG GCA C -3' , 

14.2. RESULTS 

15 RT-PCR from mouse brain RNA using primers derived from 

exons 4 and 6, as described in Section 14.1, above, resulted 
in the amplification of two products. Sequencing of these 
products showed that they differ by the presence or absence 
of sequence corresponding to exon 5. RT-PCR of RNA from 

20 C57BL/6J mice consistently yielded more of the amplified 

product containing exon 5. This result was shown to be true 
for 7 other strains tested. 

RT-PCR performed using brain RNA derived from the Mus 
spretus strain, however, invariably showed a greater 

25 abundance of the product lacking exon 5. This was 
demonstrated in 6 independent M. spretus mice. This 
quantitative pattern was also found to be exhibited in M. 
castaneous mice. Genomic PCR revealed that the intron 
preceding exon 5 was 0.5 kb shorter in both M. spretus and M. 

30 castaneous strains. Sequencing of a portion of this intron 
showed that its donor, acceptor and branch point sequences 
were not affected by the sequence missing in these strains. 

RT-PCR of total human RNA from several tissues was 
performed with two primers from exons 4 and 6 using the same 

35 conditions as for mouse RNA. The amplification primers were 
hCBT9Fll and hCBT9Rll, as described, above, in Section 14.1. 
Amplification produced two amplified fragments of 281 bp and 
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113 bp. Sequencing revealed that the larger band represented 
a transcript containing exon 5, while the smaller fragment 
was missing the sequence corresponding to exon 5. Thus the 
human tub gene, also exhibits alternate splicing of exon 5 
5 Both the human and mouse exon 5 is 168 base pairs long 
. Because this length is a multiple of 3, the reading frame of 
the transcripts lacking exon 5 is conserved. 

It is possible that variant splicing may result in 
proteins with qualitatively or quantitatively distinct 
. 10 activities. The differential regulation of alternate 
splicing may result in individuals with differential 
susceptibilities to obesity. For example, in place of the 
constitutive obesity associated with the £ub mutation, 
alleles which yield a higher amount of protein encoded by 
15 transcripts lacking exon 5 relative to the level encoded by 
transcripts containing exon 5 may confer a greater 
susceptibility to obesity only in the context of a particular 
environmental and genetic background. 

20 15. EXAMPLE: .R ECOMBINANT EXPRESStow qf ttih riKMP ^T>nr.r-.r^c. 
The Example presented in this Section describes the 
recombinant expression of murine and human tub gene products. 

15.1. MATERIAT. S AND MRTHnnc 
25 Bacteri al exp rpssinn 

m^Xne tub subcloning . cDNA sequence containing the 
entire murine imfe coding region was subcloned into bacterial 
expression vector pET29*. pET29* is a modified pET29a vector 
(Novagen, Inc., Madison WI) containing an altered Shine- 
30 Dalgarno sequence for optimal initiation of translation 
(Chen, H. et al., 1994, Nuc. Acids Res. 22:4953-4957). 

In order to subclone the iyb coding sequence into the 
PET29* vector, site directed mutagenesis was performed on an 
existing tub cDNA to create a £ub sequence with appropriate 
35 restriction sites. Specifically, single stranded DNA was 
rescued from CJ 236 E. coli transformed with pMal-c2 (New 
England Biolabs, Beverley MA) plasmid containing a PCR- 
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derived cDNA by infection with K07 M13 helper phage. The 
single stranded DNA was used as a template for site directed 
mutagenesis which yielded amplified £ufe fragments containing 
altered ends (Kunkel, T.A., 1985, Proc^ Natl. Acad, Sci. USA 
5 81:488-491) . The 5' end of the amplified fragment was 
• altered such that the tub initiation codon was contained 
within an Ndel site ( i.e. . CATATG) , while the 3' end was 
altered such that part of the tub termination codon was 
contained within an EcoRI site ( i.e. . TGAATTC) . The 

.10 resulting tub cDNA was excised as a 5' Ndel to 3' EcoRI 

fragment and ligated into Ndel /EcoRI -digested pET29* vector, 
to yield the murine pET29*-tub expression construct. 

In order to produce the murine tub -HISe expression 
construct, codons for six histidine residues were fused in- 

15 frame at the 3' end of the £ufe cDNA sequence. Site directed 
mutagenesis was employed as described above, except that the 
primers utilized yielded fragments containing the six 
histidine codons inserted just 5' of the EcoRI site at the 3' 
end of the cDNA (i.e. . CACCACCACCACCACGACTGAATTC) . The 

20 resulting mutagenized fragment was excised and ligated into 
pET29* as described above to yield the murine pET29*-tub-HIS^ 
expression construct. 

Human tub subc lonina . The entire coding region of the 
human tub sequence was also inserted into the pET29* 

25 expression vector in both native and Hl-Sg fusion forms. For 
insertion into pET29*, a human tub cDNA in pMal-C2 was 
modified via site directed mutagenesis to create 5' Ndel and 
3' EcoRI restriction sites, as described for the murine tub 
sequence, above, to yield the human pET29*- tub expression 

30 construct. 

For construction of the human tub HISg construct, six 
histidine codons were introduced just 5' of the EcoRI site by 
a three part ligation. Specifically, a 5' ApaLI - 3' EcoRI 
restriction fragment encoding the last 25 amino acids of the 
35 murine pET29*-tub-HIS6 v/as exchanged for the equivalent 

fragment of the human tub gene sequence in human pET29*-tub 
construct, to yield the human pET29*-tub-HIS^ expression 
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construct. It should be noted that, although the human and 
mouse tub genes have differing primary sequences, the amino 
acid residues they encode within this carboxy- terminal region 
are identical. 

5 Expression of recombinan t tub proteins . Host bacteria 

BL21(DE3) (Novagen, Inc., Madison WI) were chemically 
transformed with each of the expression constructs described 
above (i_^, murine pET29*-£ijfe, murine pET29*-tub-HIS^ , human 
pET29*-tub or human pET29*-£iib-HISfi) and grown in 6 liters BHI 

10 (Brain Heart Infusion broth) cultures to mid- log phase 
(OD595=l.o) at 37° C. 

T7 RNA polymerase and, concomitantly, tub protein, 
expression, was induced by the addition of IPTG to a final 
concentration of 0.5 mM. Two hours post -induction, bacteria 

15 were collected by centrifugation and frozen. 

Mammalian expression . 

Murine tub subclonina. To prepare murine tub cDNA 
containing the entire tub coding region, the 5' end of the 

20 murine tub cDNA in the murine pET29*-£ub construct was 

mutagenized to remove the Ndel restriction site, and replaced 
with a BamHI restriction site and a Kozak box (Kozak, M. 
1987, Nuc. Acid Res. 15:8125-8132) for efficient initiation 
of translation in mammalian cells. After modification, the 

25 sequence just 5' of the tub start codon was as follows: 
GGATCCACCATG (the Start codon is underlined) . 

The modified sequence was digested with BamHI and EcoRI 
to excise the region to be subcloned. After excision, the 
murine tub cDNA was ligated into the transient expression 

30 vector pNSc (to yield the pN8 6- tub construct) and into the 
stable retroviral expression vector pWZLblast (to yield the 
pWZLblast-iiib construct) . Transcription in pNBc is directed 
from the human. CMV immediate early promoter, while 
transcription from pWZLblast is initiated in the promoter 

35 embedded Moloney Leukemia Virus LTR. 

Constructs for the expression of epitope tagged 
recombinant tub gene product were generated, in which a DNA 
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fragment encoding three tandem copies of the influenza virus 
hemagglutinin peptide YPYDVPDYA was fused in- frame with the 
NH2 terminus of the tub cDNA in pN8e-tub. Specifically, the 
triple flu epitope was amplified from the plasmid pBS HA^ via 
5 PGR with primers possessing 5' Hindlll and 3' BamHI 
restriction sites. The PGR product was purified, 
Hindlll/BamHI digested and ligated into Hindlll/BamHI 
digested pN8e-tub. The correct sequence of the fusion 
construct (designated pN8€3Xf lu-tub) was verified. 

10 Express ion of recombinant tub proteins . Transient 

expression is achieved by transfection of pNSe-tub or 
pNBeSXf lu-tub expression constructs into 293 EBNA cells 
(Invitrogen Corp.) via lipofection (Lipof ectamine; Life 
Technologies Corp.). Analyses performed 72 hours post- 

15 lipofection. 

Stably infected polyclonal pools of NIH 3T3 cells 
harboring pWLZblast-tiib proviruses are generated by 
transiently transfecting QE producer cells (Morgenstern, J. P. 
& Land, H. , 1990, Nuc, Acids Res. 18:3587-3596) with calcium 

20 phosphate and harvesting recombinant retrovirus 48 hours 
later. The virus is then used to infect target NIH 3T3 
fibroblasts overnight at which time the infected cells are 
split 1:10 into medium supplemented with blasticidin HCl (ICN 
Corp.) at a concentration of 10 ptg/ml . Colonies of 

25 blasticidinS HCl-resistant cells which appear within roughly 
two weeks are pooled and lysed for analysis. 

15.2. RESULTS 
Described herein is the successful expression of 
30 recombinant murine and human tub gene products in mammalian 
and/or bacterial systems. With respect to bacterial 
expression, both native and HISg fusion ( i.e. , a fusion 
protein containing six carboxy- terminal histidine amino acid 
residues following the native tub amino acid sequence) tub 
35 gene products have been expressed. Details regarding the 
creation of tub expression constructs and production of gene 
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products using these constructs are described, above, in 
Section 15.1, 

Aliquots of bacterial lysates (representing 
approximately lO"^ of the total 6 liter preparation were 
5 analyzed using standard SDS polyacryl amide gel 

electrophoresis, as depicted in FIG. 11. A protein with a 
molecular weight of approximately 57 JcD was readily apparent 
in proteins obtained from induced bacteria containing murine 
pET29*-tub. 57 kD was the approximate molecular weight one 

10 would predict for the murine tub protein, with its 505 amino 
acid residues. Likewise, a protein with a molecular weight 
of approximately 57 kD was readily apparent in proteins 
obtained from induced bacteria containing human pET2 9*-tub. 
57 kD was the approximate molecular weight one would predict 

15 for the 506 amino acid residue human tub gene product. 

A protein exhibiting a slightly increased molecular 
weight was readily apparent in proteins obtained from induced 
bacteria containing either human or murine pET29*-HISfi. The 
slight increase in molecular weight was expected given the 

20 additional six histidine residues present in these tub ^HIS^ 
fusion proteins. 

Constructs for the expression of epitope-tagged murine 
tub protein were utilized to demonstrate successful mammalian 
expression of recombinant tub gene product. Specifically, 

25 the pN8e3Xf lu-tub expression construct was introduced, via 
lipofection, into 293 EBNA cells, as described, above, in 
Section 15.1. After lipofection, immunoprecipitation 
followed by Western blot detection with the monoclonal 
antibody 12CA5 (directed against the flu hemagglutinin 

30 peptide; Boehringer Mannhein Corp.) was performed. Western 
blotting revealed the presence of a protein exhibiting a 
molecular weight of approximately 59 kD ( i . a size 
expected of the full length tub gene product fused the triple 
flu hemagglutinin peptide sequence.) . No such protein was 

35 detected in control transf ections with non-hemagglutinin 
tagged pNSe- tub constructs. 
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In summary, the results described herein indicate that 
recombinant murine and human tub gene products have 
successfully been expressed in bacterial and/or mammalian 
systems. 

5 

16. EXAMPLE: IDENTIFICATION AND CHARACTERIZATION OF A 
tub GENE HOMOLOG 

The Example presented in this Section describes the 
identification and characterization of a human tub gene 
homolog, referred to herein as human tub homolog 1. 

The mouse tub gene nucleotide sequence was utilized as a 
database query using the Blastx program (1993, Nature 
Genetics 3:266-272), which resulted in the identification of 
a human EST (GenBank Accession No. H92408) which exhibited a 
75.3% identity over 85 derived amino acid residues. The EST 
was originally derived from a human retinal library (Soares, 
B. and Benaldo, F. ) . 

Upon identification of the EST, the corresponding clone 
was obtained from Genbank and sequenced. A number of errors 

2Q in the sequence listed in the database were found. These 
included a deletion of bp 33 from the Genbank sequence, 
incorrect base pair insertions (Genbank sequence bps 330, 
339, 359, 366, 375 and 384), incorrect sequence at bps 133- 
137 (ACCGA in Genbank sequence, as opposed to the correct 

25 GGCCG sequence) and incorrect bp 398 (T in Genbank as opposed 
to the correct G) • 

The identified sequence was used to screen a retinal 
cDNA library, which resulted in the identification of several 
positive clones. FIG. 12 depicts nucleotide sequence of the 

30 homolog identified via this screening effort, which is 
referred to herein as human tub homolog 1. The sequence 
depicted in FIG. 12 codes for a substantial portion of the 
human tub homolog 1 protein, the derived amino acid sequence 
of which is also depicted in FIG. 12. The sequence exhibits 

2g a 73.9% identity over 216 derived amino acid residues. 

The EST derived from the human tub homolog 1 gene was 
mapped in the human by PCR typing of the Genebridge (G4) 
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radiation hybridization panel. Typing of the DNA and 
comparison to radiation hybrid map data at the Whitehead 
Institute Center for Genome Research (WICGR) tightly linked 
the EST to an anonymous STS, WI-4186, on human chromosome 6. 
5 Additionally, the EST was genetically mapped in the 

mouse using a C57BL/6J x Mus spretus interspecific backcross. 
Genotyping of 100 meioses demonstrated linkage to a region on 
mouse chromosome 17 between D17Mit4 8 and D7Mit 9. 

Human multiple tissue northern blots (Cat. No. 7766-1 

10 and 7760-1/ Clonetech, Palo Alto CA) containing 2Mg of poly 
A+ RNA per lane were probed with the approximately 1,3 5 kb 
EcoRI - Not I fragment of the sequence obtained from Genbank 
containing the human tub homolog 1 insert. The filters were 
prehybridized in 5 mis of Church buffer at 65** C for 1 hour, 

15 after which 100 ng of "P- labelled probe was added. Probe was 
made using Stratagene Prime-It kit (Cat. No. 300392; 
Stratagene, La Jolla CA) . Hybridization was allowed to 
proceed at 65° C for approximately 18 hours. Final washes of 
the filters was in 0.1% SDS, 0,2xSSC solution for 65* C. 

20 Washed filters were exposed to a phosphoimager for 4 hours. 
The Northern analysis was performed using a 1.35 kb 
probe as described in Section 16.1, above, containing human 
tub homolog 1 sequence encoding 285 amino acids plus 3'- 
untranslated sequence to the poly-A sequence was performed. 

25 Tissues tested included brain, lung, liver, kidney, spleen, 
thymus, muscle, prostate, testis and fat. A message of 
approximately 2 kb was apparent in the lanes containing RNA 
from skeletal muscle and testis. 

30 17. DEPQ^JT QF MiECRQORgANISMS 

The following microorganisms were deposited with the 
American Type Culture Collection (ATCC) , Rockville, Maryland, 
on the dates indicated and were assigned the indicated 
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accession 


numbers : 








Microorqanism 


Clone 


ATCC Access. No. 




H019 (E. 




fun\h019 


69856 




5 E/P8 (E. 


coli) 


P8 


69858 


•Ti mA in *iQoc 


E/P6 (E. 


coli) 


P6 


69857 


June 30, 1995 


E/B13 (E. 


coli) 


B13 


69859 


June 30, 1995 


CBT9H1 (E. 


coli) 


CBTgHl 


97222 


July 10, 1995 


CBT9H2 (E. 


coli) 


CBT9H2 


97221 


July 10, 1995 


10 CBT9H3 (E. 


coli) 


CBT9H3 


69874 


July 28, 1995 



The present invention is not to be limited in scope by 
the specific embodiments described herein, which are intended 
as single illustrations of individual aspects of the 

15 invention, and functionally equivalent methods and components 
are within the scope of the invention. Indeed, various 
modifications of the invention, in addition to those shown 
and described herein will become apparent to those skilled in 
the art from the foregoing description and accompanying 

-.J drawings. Such modifications are intended to fall within the 
scope of the appended claims. 



25 



30 



35 
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International Application No; PCT/ / 



MICROORGANISMS 




optional Sheet tn connection with the microorganism referred to on page 96. lin 


es 1-2S of the description ' 


A. IDENTIFICATION OF DEPOSIT ' 




Further deposits are Identified on an additional sheet ' 




Name of depositary institution * 




American Type Culture Collection 





Address of depositary institution (including postal code and country) • 

12301 Parklawn Drive 
RockvUle, MD 20852 
US 



Date of deposit ' June 29. 1995 Accession Number • 69856 



B. ADDITIONAL INDICATIONS ■ (l»ve bUnk if •» viAiabk), Thi, hrformaiion 



IS Gomiiiued on a iqmie anactKd iheet 



C, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' , 



D. SEPARATE FURNISHING OF INDICATIONS * (tow biaiik if not ippiiciblc) 



The Intfitatlont Itnad Iwlow will be submined to the Iniernaiional Bureau later • (Specify the general nature of the indications e.a. 
"Accession Number of Oeposit'l 



E. This sheet was received with die International application when fded (to be checked by the receiving Office) 

iM'j/A -^ 



(Authorized Officer) 
□ The date of receipt (from the applicant) by the Iniemadonal Bureau " 



was 



(Authorized Officer) 

Porm PCT/ROyiW iJanuary ISftil 
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International Application No: PCT/ / 

Form PCT/RO/134 (cont.) 

American Type Culture Collection 

12301 Parklawn Drive 
Rockvitle, MD 20852 



US 



Accession No. 



Date of Deposit 
June 30, 1995 



69858 



69857 



June 30, 1995 



69859 



June 30. 1995 



97222 



July 10, 1995 
July 10, 1995 
July 28, 1995 



97221 



69874 
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WHAT IS CLAIMED Tfi . 

1. An isolated nucleic acid molecule which 
encodes a body weight modulating tub protein, having a 
nucleotide sequence that: 

5 (a) encodes a polypeptide having the amino acid 

sequence shown in FIG. 6; 

(b) encodes a polypeptide having the amino acid 
sequence shown in FIG. 9; 

(c) encodes a protein containing the amino acia 
sequence encoded by the cDNA contained in cDNA 
clone fumhOlS, CBT9H1 or CBT9H3 , as deposited 
with the ATCC having accession No. 69856, 
97222 or 69874, respectively, or the genomic 
DNA contained in clone P6, PS, or B13, as 

^5 deposited with the ATCC having accession No. 

69857, 69858, or 69859 respectively; or 

(d) hybridizes under stringent conditions to the 
complement of the nucleotide sequence of (a) , 
(b) or (c) . 

20 

2. The isolated nucleic acid molecule of Claim 1 
wherein the isolated nucleic acid molecule contains the 
nucleotide sequence shown in FIG. 6. 

25 3 . The isolated nucleic acid molecule of Claim 1 

wherein the isolated nucleic acid molecule contains the 
nucleotide sequence shown in FIG. 9. 

4. An isolated nucleic acid molecule which 
30 encodes a tub polypeptide having: 

(a) the amino acid sequence encoded by the cDNA 
contained in cDNA clone fumh019, CBT9H1 or 
CBT9H3, as deposited with the ATCC having 
accession No. 69856* 97222 or 69874, 
35 respectively, or the genomic DNA contained in 

clone P6, P8, or B13, as deposited with the 
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ATCC having accession No. 69857, 69858, or 
69859 respectively; 
(b) the amino acid sequence shown in FIG. 6, 
lacking amino acid residues 134-189; or 
5 (c) the amino acid sequence shown in FIG. 9, 

lacking the amino acid residues 134-189. 



5- The isolated nucleic acid molecule of 

Claim 4 wherein the isolated nucleic acid molecules contains 
10 (a) the nucleotide sequence shown in FIG. 6, 

lacking exon 5; or 
(b) the nucleotide sequence shown in FIG. 9, 
lacking exon 5. 

15 6. A nucleotide vector containing the nucleotide 

sequence of Claim 1 or 4 . 



7. An expression vector containing the nucleotide 
sequence of Claim 1 or 4 in operative, association with a 

20 nucleotide regulatory sequence that controls expression of 
the nucleotide sequence in a host cell. 

8. The expression vector of Claim 7, wherein said 
regulatory element is selected from the group consisting of 

25 the cytomegalovirus hCMV immediate early gene, the early or 
late promoters of SV40 adenovirus, the lac system, the trp 
system, the TAC system, the TRC system, the major operator 
and promoter regions of phage A, the control regions of fd 
coat protein, the promoter for 3-phosphoglycerate kinase, the 

30 promoters of acid phosphatase, and the promoters of the yeast 
or-mating factors . 

9. A genetically engineered host cell that 
contains the nucleotide sequence of Claim 1 or 4 . 

35 

10. A genetically engineered host cell that 
contains the nucleotide sequence of Claim 1 or 4 in operative 
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association with a nucleotide regulatory sequence that 
controls expression of the nucleotide sequence in the host 
cell- 

5 11. A method of preparing a protein containing: 

(a) the amino acid sequence shown in FIG. 6; 

(b) the amino acid sequence shown in FIG. 9; 

(c) the amino acid sequence encoded by the cDNA 
contained in cDNA clone fumhOlS, CBT9H1 or CBT9H3, 

10 as deposited with the ATCC having accession No. 

69856, 97222 or 69874, respectively, or the genomic 
DNA contained in clone P6, PS, or B13, as deposited 
with the ATCC having accession No. 6 9857, 69858, or 
69859 respectively; 
15 (d) the amino acid sequence shown in FIG. 6, lacking 

amino acid residues 134-189; or 
(e) the amino acid sequence shown in FIG. 9, lacking 
the amino acid residues 134-189, 
wherein the method comprises: culturing the genetically 
20 engineered host cell of Claim 10, so that the protein is 

expressed in recoverable amounts, and recovering the protein 
from the culture or host cell. 



12. An isolated polypeptide having the amino acid 
25 sequence shown in FIG. 6 or FIG. 9, or the amino acid 
sequence shown in FIG. 6 or FIG. 9, lacking amino acid 
residues 134-189, or the amino acid sequence encoded by the 
cDNA contained in cDNA clone fumh019, CBT9H1 or CBT9H3 , as 
deposited with the ATCC having accession No. 6 9856, 97222 or 
30 69874, respectively, or the genomic DNA contained in clone 
P6, P8, or B13 having accession No. 69857, 69858, or 69859, 
respectively. 



13 . An antibody that immunospecif ically binds the 
35 polypeptide of Claim 12. 
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14. A method for diagnosing body weight disorders, 
or a propensity for body weight disorders, in a mammal 
comprising measuring tub gene expression in a patient sample. 

5 15. The method of Claim 14 in which expression is 

measured by detecting mRNA transcripts of the tub aene . 

16 . The method of Claim 14 in which expression is 
measured by detecting the tub gene product. 

10 

17- A method for diagnosing body weight disorders, 
or a propensity for body weight disorders, in a mammal, 
comprising detecting a tub gene mutation contained in the 
genome of the mammal. 

15 

18. The method of Claim 17 in which the mutation 
is located in a splice site of the tub gene. 

19 . A method for screening compounds useful for 

20 the treatment of body weight disorders, comprising contacting 
a compound with a cultured cell that expresses the tub gene, 
and detecting a change in the expression of the tub gene by 
the cultured cell. 

25 20. The method of Claim 19 in which expression of 

the tub gene is detected by measuring mRNA transcripts of the 
tub gene. 

21. The method of Claim 19 in which expression of 
30 the tub gene is detected by measuring tub protein. 

22. A method for treating a body weight disorder, 
in a mammal comprising administering a compound to the mammal 
that modulates the expression of the tub gene in the mammal. 

35 

23. The method of Claim 22 in which expression of 
the tub gene is up-regulated. 
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24. The method of Claim 22 in which expression of 
the tub gene is down- regulated . 

25. The method of Claim 24 in which the compound 
5 provides an antisense or ribozyme molecule that blocks 

translation of tub mRNAs. 

26. The method of Claim 24 in which the compound 
provides a nucleic acid molecule that is complementary to the 

10 5' region of the tub gene and blocks transcription via triple 
helix formation. 

27. A method of treating a body weight disorder 
resulting from a mutation in the tub gene in a mammal, 

15 comprising supplying the mammal with a nucleic acid molecule 
that encodes the wild-type tub protein to exert a weight 
modulating effect. 

28. The method of Claim 27 in which hypothalamic 
20 cells that express the wild- type £ii]2 protein are introduced 

into the mammal . 

29. The method of Claim 27 in which the 
hypothalamic cells are engineered gx vivo to express the 

25 wild- type tub protein. 

30. The method of Claim 29 in which the 
hypothalamic cells are contained in a carrier. 

30 31. The method of Claim 27 in which a nucleic acid 

molecule encoding the wild-type tub protein, contained in a 
pharmaceutically acceptable carrier, is administered to the 
mammal . 

35 32. The method of Claim 31 in which the carrier is 

a DNA vector, a viral vector, liposome or lipofectin. 
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33. The method of Claim 31 in which the nucleic 
acid encoding the wild-type tub protein is introduced into 
the mammal's hypothalamus. 

5 



10 



15 



20 



25 



30 



35 
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> HTS K PHSOWZPYSV 14 

occacoo co ccoccoacQOQftaiL asg act toc aao'ooq ckd nc gac vqs jot- ocr nc Asr ozc 42 

JL;,?-.^ ^ ^ SNIi.RQQkLDRQRAI, L 34 

cm GAT GAT GAG OQC AOC AACCTOAOQCAOavOAAGCICGACOQSCAO GGO OOC CSO VTO 102 

K K R Q B P L H V Q A H A D O 54 

GAA GAG AAG GAG AAO AAG AAO OOC CAA GAG OOC VZO ASO OCA GAG OOC AASP OCA GAS OOA 162 

RPRSRRARQSBBQAPZiVS SY 74 

OGO OOC OQG AGT G9G CX2A OOC 009 GAG VGA GAG GAG GAA GOC OOC GIG 01X5 GAG teC lAC 222 

• X* S S S O S T. S y Q V Q B A D S X A S V '54 

GXC AGO AGO AGT 09C AGO AOC A0CTACCAAGXTCAAGA0O0CGAC7DG AST GOC AGT QSk 2B2 

QLGATRPPAPASAKKSKGAA U4 

GAG GXG OGA OOC AOC OGO OCA OCA GCA OCA OCT TCA GCC AAG AAA TOO AAO G8A GOG OCT 342 

ASGGQGGAPRKBKKGKBK G T 134 

GCA TCT QQQ OGC GAG OCT OGA GOC OCT AGO AAG GAG AAG AAG 'OGA AAG GAT AAA 09C ACC 402 

S G P A T i«AEPKSE AQGP*V QXZ. 154 

AOC OQG OCA GCA ACT CTG GCA GAA GAC AAG TCT GAG GCC GAA GOC OCA GTG CAG AXC TTO 462 

TV G Q S DHDKDAGETA.AGGGA 174 

ACT GIG GGA GAG TCA GAC CAC GAC AAG GAT GOG OGA: GAG ACA OCA GCC OOC GOG GOC GCA 522 

QPSGQDL RATKQRKGISfiSM 194 

GAG OCC AGT OGG GAG GAC CTC OOP OOC AOO ATG GAG AGG AAG GOC ATC TOO AOC AOC ATO 582 

SP D E' D E DfiDEHSSSSSQLNS ^214 

AGO TTT GAC GAG GAC GAG GAT GAG GAT GAA AAC AOC TOC AGC TOG TCC GAG CEA AAC AGC 642 

N T R P S S A . T • S R K S I R E A A S A' P 234 

AAC ACC COC OCT AGT TCT OOC ACT AGC AGA AAC TCC ATC COG GAG GCA GCT TCA OOC OCC 702 

SPAAPEP PVDIEVQDLE EFA 254 

AOC OCA GOC OOC OCA GAG CCA OCA GIG GAT ATT GAG GTC GAG GAT CIA GAG GAG TTT OCA 762 

P A P QGITIKCRITRDKK G 
CTS AGG CCA GCC CCA CAA GOG ATC AOC ATC AAA TOC OOC ATC ACT 03G GAC AAG AAG GOG 

HDRG MY PTYFLHLDREDG K 



ATG GAC OOC OGO ATC TAG COC ADC TAC TTT GIG GAC CPA GAC OOP GAG GAT OOC AAG AAG 



GTG TTC CTC CIG GCG OGC AGO^AAG AGA AAG AAC AGT AAA ACT T^C AAT TAC CIC ATC TCT 



274 
822 



R 294 
882 



F L L A G RKRK KSK, TSNY!..! S 314 



942 



PTDI.;*SROGDSYZGKLRSM 334 



1002 



GTG GAC OCA ACA GAC TTO TCP 000 OOA OOC GAT AOC JSAT ATC 000 AAA TTG OOQ teC AAC 

JL. ^ T K P TVYDHGVN-PQK.AS S 354 

cro ATO OGC AOC AAG TTC AOC OPT TAT .GAC AAT OOC GPC AAC OCT GAG AAO OCA TOO TCP 1062 

^ ^ ^ ^ ^ TLRQELAA. VCY E TH 374 

TOC ADC CTO GAA ADC OC3A AOC TTG OGC GAG GAG CTC OCA GOG GTS TOC TAT GAG ACA AAT 1122 

rX, JS: ^ ^ ^ ^ PRKMSV IV*. ppM NKV 394 

GTC CPA OOC TTC AAO GGA COT 000 AAG ATG AGT OPG ATC GTC OCA OOC ATG AAC ATC OTT 1182 
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HBRVC iRPflLHEHETLLHRWO 
CAT QM AGX CTC T5T ATC COC CCC CCC AAT 6A\ CAT QKG ACC CTC TTA CCA CCC «50 CAC 

NKKT B £ J iBI^QNK TPVWMa)© 
AAC AXG AXC ACC CAC ACC ATC ATT GAS CTC CAC AAC AAC ACQ CCA CTC TCC AAT CAT GAC 

TQ SVVl*KPKC RVTQASVKNr 
ACA CAC TCC TAT CTA CTT AAC TTC CAC CCC COT OTC ACA CAG OCT TCT CTC AAC A/iC TTC 

QI I H G NDPDyiV KQPCRVAE 
CAC ATC ATC CAC CCC AAT CAC CCC CAC TAG ATC OSC ATC CAC VTT CCC COG OTA GCA GAA 

r T M P VIJ V .PX.eALCAPAXA 
OAT ore TTC. ACC ATO CAT TAC AAC TAC OCA CTC TW CCA CTC CAC OCC TTC CCC ATT CCT 

I'S SPD SKI.A.Cr* 
CTC TCC ACC TTT GAC ACC AAC CTG GCc' TCC CAC TAC ACGCCCCCCACTGCCCTTAGCWOOC'^UBTC 

CCCAOTGCACCTTgCCTCCCTCCeAACACACCCCTCCCTA C CCTCTOrr a iTACCCCCTCTAT^ 



414 
1242 



494 

U02 

454 

474 
1422 

494 
1482 
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tubgenoxaxc 
B6erenomic 
tubcdna 
BGcdbia 

«1 

rubgenomic 
B6genQmic 
bubcdna 
B6cdna 

«41 

cubgenomlc 
B6genoznic 
tubcdna 
BGcdna 

«8i 

tubgenoxoic 
B6genomic 
tubcdna 
B6cdn:a 

#121 

tubgenoznic 
B6genomic 
tubcdna 
BGcdna 

ftl61 

tubgenozxiic 
B6genomic 
tubcdna 
B6cdna 

#201 

tubgenomic 
Begenomi'tif 
tubcdna 
B6cdna 

«24r 

tubgenoxoic 
BSgenoxnic 
ttabcdna 
B6cdna 

#281 



4^ 

iU:G6CAATGA C CTTGAGTC T TGQCACTCGC 'TOTrrTTGAT 

AcoeCAMGA rqszsaoaisT toccactccc totttttgat 

A06GCAM«A .CCTTGAGTGT TOCCACTCCC TOTTTTTGAT 
ACGGCAAT6A CC:t::s;:; :::::::::: zzrtzzzzzz 



GTTGTACGCA TGGTGCCCAG CCCCCACCCC ACOCCCA&TC 
GTTGZftdQCA T60T6CCCAG OCCCCACOCC ACCOCC&ASCC 
GXTOTiyOGCA .aX3GTBCCCAG COCOCftCCCC ACOOCCSUkSC 
s:8s-scs:st.ttitstt:ss ttxtsescss sssstttttt 



CQCTGAZCTG GICCATATai GCCAGTQ2m? GGSOGTQCSGT 
.CCCTQATCTG' GTCC&TATCA GCCAGXGATG OGa TC Xt SGCT 
OCCZQATCTG GTCCASATCA CCGAGTGATO G6ATGTOGGT 
::: s 8 : s s ::::: t : : : zi z .: z t iz zz 



AmiTCGCTTT TGTTAGA&CT TTCTAACTGT AQTGATCTAG 
ATATGGCTTT TGTAAGAACT TTCTAACTGT AGT6AIZCTAG 
ATATGGCTTT TGTTAGftACT TTCTAACTGT AGTGAZGXAG 
S8:::::tit s::::ss::: ;:::s:stss sssrtssssi 



.AGTCCTGCCC CTAGTGCCCT GCATGTCTGG GGCTTGGGaA 
AGXCCSGGCC CTAGTGCCCT GCATGTCTGG 6GCTT66G3UI 
AGTCCTGOCC CTAGTGCCCT GCATGTCTGG 6GCTTGGG3kA 

•••••••••• •••••••••• «>*S»»«;;« ••5SS85SSS 



.TACC CTTTA A ATGGAT6TCT 
TACCCTTTAA ATGGATGTCr 
TACCCTCTAA ATGGATGTCT 

SSSSSS5SS" •••••••••• 



TTTCTCTCCT GGGCCCTGCT 
TTTCTCTCCT GGGCCCTGCT 
TTTCTCTCCT GGGCCCTGCT 
:::::::::: ss;;::*::: 



GICTGTGTGC ATCTCCCCCC'TTCACCCTCT TGCTTCATAA 
GT CTgTG TGC ATCTCCCCCC TOCACCCTCT TGqTTCATAA 
GTQSDGTGTGC ATCTCCCCCC TTCACCCTCT TGCTTCATAA 
;:8't;s:t:; :s:::s;;:: :sss5:^x:: zzzzzzzzzi 



TGTTTCTCTT GAACCTTTGT TTTGTTCATC CTTTCGATCT 
' T GTTTC TCTT GAACCTTTGT TTTGTTCATC CTTTCQATCT 
TGTTTCTCTT GAACCTTTGT TTTGTTCATC CTTTCGATCT 
ssststsxtx :::sssss:: :x»:s::::: ttsxxssss: 
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fcubgenomic CTTTGGCACT TOTGCTITCT CCTTCCCTCT TCTGGCCC&T 

BSgenomic CTTTGGCATT TCTGCTTTCT CCTTCCCTCT TGTGGCCCHT 

txibcdna CTTTGGCATT TCTGCTTTCT.CCQPTCCCTCT TGXGGCOCAT 

BGcdna *s ::::::::: :::::::::: ssssrssss: 

#321 •wm.mmmmmm 

.tubgeaomic GTCTTACCTG* !gtcTCCCTGT CTCCACCATT CTTGCTTOTG 

BSgeaomic GTCTTACCTG GTCTCCCTGT CTCCACCATT CTTCCTTCTG 

tubcdna OTCTTACCTQ GTCTCCCTGT CTCCACCWPT CTTCCTTCTG 

B6cdna :ss::s:::: :::::::::: 

•«361 ********** 

tubgenoniic CATTCCACAG CGGACTACAT CGTCATGCAG TOWGGCC 

B6geaomic CATTCCACAG OGOACTACAT CGTCATGCAG TTTGGCC 

tubcdna CATTCCACAG CGGACTACAT CGTCATGCAG TTTCGCC 

B6cdna :::::::::: OGGACTAQAT CGTCATGCAG TTTGGCC 

#401 
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ASOOaKrGCriC'OCGGQGACX»TAOCmXX3QCX 3 G^ 

MTSKPHSDW 9 

OCOGGOCTCXSUGJUSCCGCAGOCa^ ATG ACT TCC AAG COG CAT TCC GAC TGG 27 

X PysVLDDEG RNLRQQK£.DR 29 

ATT COC TAC AGT GTC TTA GAT GAT GAG OGC AGA AAC CTG AOG CAG CA6 AAG CZT GAT COG 87 

QRALLEQKQKKKRQEPLKVQ 49 

CAG COG GCC CTG CTG GAG CAG AAG CAG AAG AAG AAG OGC CAG GAG COC CTG ATG GTC CAG 147 

ANAD GRPR SRRARQSEEQ A P 69 

OCC AAT GCA GAT GGG OGG COC COG AOC OGG COG GCC CGG CAG TCA GAG GAA CAA GCC CCC 207 

LVE S YLSSSGST SYQVQE AD 89 

CTG GTG GAG TCC TAC CTC AOC AGC AGT GGC AGC ACC AGC TAG CAA GTT CAA GAG OCC GAC 267 

SLASVQLGATRPTAPASAK R 109 

TCA CTC OCC AGT GTG CAG CTG GGA GCC ACG CGC CCA ACA GCA CCA OCT TCA GCC AAG AGA 327 

TKAAATAGGQGGAARKEKXG 129 

ACC AAG GCG OCA OCT ACA GCA GGG GGC CAG GOT GGC GCC OCT AGG AAG GAG AAG AAG GGA 387 

KHKGTSGPAALAEDKSEAQG 149 

AAG CAC AAA GGC ACC AGC GGG CCA GCA GCA CTG GCA GAA GAC AAG TCT GAG GCC CAA GGC 447 

PVQILTVGQSDHAQDAGETA 169 

CCA GTG CAG ATT CTG ACT GTG GGC CAG TCA GAC CAC GCC CAG GAC GCA GOG GAG ACG GCA 507 

AGGGERPSGODLRATMQRKG 189 

OCT GGT GGG GGC GAA CGG CCC AGC GGG GAG GAT CTC CGT GCC ACG ATG CAG AOG AAG GGC 567 

IS SSMSFDEDEEDEEENSSS 209 

ATC TCC AGC AGC ATG AGC TTT GAC GAG GAT GAG GAG GAT GAG GAG GAG AAT AGC TCC AGC 627 

SSQLNSN TRPSSATSRKSVR 229 

TCC TCC CAG CTA AAT ACT AAC ACC CGC CCC AGC TCT GCT ACT AGC AGG AAG TCC GTC AGG 687 

EA ASAPSP TAPEQPVDV'EVQ 249 

GAG GCA GCC TCA GCC CCT AGC CCA ACA GCT CCA GAG CAA CCA GTG GAC GTT GAG GTC CAG 747 

DI-EEFAIjRPAPQGITIKCRI 269 

GAT err gag gag TTT GCA CTG AGG cog GCC CCC CAG GGT ATC ACC ATC AAA TGC CGC ATC 807 

T K . K G. ...M D R G M Y P T Y F L H li D 289 

ACT OGG GAC AAG AAA GGG ATG GAC OGG GGC ATG TAC CCC ACC TAC TTT CTG CAC CTG GAC 867 

REDGKKVFLLAGRKRKKSKT 309 

CGT GAG GAT GOG AAG AAG GTG TTC CTC CTG GCG GGA AGG AAG AGA AAG AAG AGT AAA ACT 927 

SNYLISVDPTDLSRGGDSYI 329 

TCC AAT TAC CTC ATC TCT GTG GAC CCA ACA GAC TTG TCT CGA GGA GOG GAC AGC T5^T ATC 987 

GKI-RSNLM GTKFTVYDNGVN 349 

GGG AAA CTG CGG TCC AAC TTG ATG GGC AOC AAG TIC ACT GTT TAT GAC AAT GGA GTC AAC 1047 

PQKASSSTLESGTLRQELAA 369 

CCT CAG AAG GCC .TCA TCC TCC ACT TIG GAA AGT GGA ACC TTA CGT CAG GAG CTG GCA GCT 1107 

VCYETNVLGFKGPR'kMSVIV 389 

GTG TGC TAC GAG ACA AAC GTC TTA GGC TTC AAG GGG CCT CGG AAG ATC AGC GTC ATT GTC 116" 



wo 97/02048 



12/18 



PCT/US96/11186 



pgmnmvherv sirpr nehbt" ^ 

CCAGGCATGAACAaXSOTrcaTGRGABAGTCTCTMCCGC CTC.OOC AAC GAG C3k!mO 1227 

eiXS CTA CJCA CGC TOG CAO MT AIUBJ«CMOCMAOTATCJaCG»GCTOqjlJ«CJU«ACIl 1287 

PV WN D DTQSY VLN. FHGRV-TO AAa 

CCTGTCTOSAATGATGACACSlCASTCCTATGTAClCAACTTCattOOOCOCOlCJ^ 1347 

ASVKNFQIIHGNDPDYIVMO am 

0CCTCC0TCAA0AACTTCCiyBAaCATCCM0GCA»TGRCO0GO»C«CMCOrojmja^ 1407 

FO R V A EDVFTMDY NYP L C AI. 4B9 

ITT GOC 03G GTA OCai GAG GAT GTO TtC *0C ATG GW TAC AAC T»C OSS CIO TOT <KA CI6 1467 

Q^faialssf'dsklace* soe 

CAGOCX:TITO0CAaTG0CCrBTOC«QCTICGAC*0CAAOCIO0C»WCejUS«AO AtaoOCIC 1528 

wa aiUA.ciTxu» aTtoooca«XOTaAoo ^ ^^^^ 

TATA0«XTraXiCCatf8A!iq»»SC^^ ^ggg 
"^*°'^°'^°«»O0GaA0TOOI0GM 

°'^'i«^*«»a«a«?icccAcc^^ ^^^^ 

TICCAAGCTTGQGAAAOOGOOGCSUlAAAOQCRTTAGGAGaOGAG j^ggg 
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AGCCTACAGTTTAAACAGTOGJOCTAGACTTAATTAAGGNTCCGGNGCG 

CCCCCGGGTACCGAiXaxyrGGTCiaiCCCACTGCCrrGT^^ 

ATCTGGGGATCTTTCePGAfSeAG TTCAAGAGty^CGAC^^ 

TGCAGCTGGQAGCCACGCQCCa^CAGCACXaGCTTCAGCCAAGAG^^ 

AAGGCGGCAGCTACAGCAGGGGGCCAGGGCGGCGCCX3CTAGGAAGGAGAA 

GAAGGGAAAGCACAAAG OTCAGCTCAO^TTCTgrACAGCCCTaCCCAGCA 

GGCCTGGCCTCCACTXSTAGGGCTGGGGAAGGTTTGTCCTCCTGACTT^ 

GGGGAGGGATAGGATGAACAGCCTCAGGGAASACACAGACTGCCJICT^ 

GGCACCCCCTCAGGTGGCTCaiCAGGCCTCATCTAGCTTGGGAGGTGCCTC 

GGCTGCCTCTGGGTGTGGGCATGCCTACCAACACTGCXIAGGAAGT^^ 

CCTGCTCAGCTTTGGCCCAC3AA(X:ACCGTCCC1»N^ 

CCTTGAGGAA(XrrTTATNAT6ACX:CCimiAA^^ 

GGATT 



TTCAAGGGCCAAAGTTTTTTAATGATGTATGGGAGTTAATGAAGGNGGTA 

TGTGGGTITZGTTNGNGGAAGAAAACACCAGC^TTGATGGTTGTAGimSKT 

GGTGTCCAKGAATGATTGCTGGCCTK3CCTATGGTOTC 

TTNTCCCATCTTGTTTTTTCCCATGTGCAGTTGGTTTTTGTAGATGGCTG 

CCGTCTGCTTTAAAGGACGTGAGGTGTTGTAAACCyACCCTCGGCAATTA 

ATTTGGGGGAAGAGCASAAGAAATGAAGCCCAACATCCCTTACTAGCrrTA 

COlGTTGTTAACAGGCTGGTGCAATam^AGTTTTATAAAAAT^ 

GCAAATAAAGITTTGCAGAGGGTTTCCCCACTCTTCCCTCaiTCCCCTTCA 

TGGACGTCTGAGAATCa^GGCCCIXXrrC?lX:CTCCTCCTGGATG^ 

fysrRTflTPPnTCGggroeAfaty^AggAGgGGGCgAG CAGCACTGGCAGAAG 

ACAAGWCTGAGGCCCAAGQCCCAGTGCAGATTCTGACTGTGGGCCAGTCA 

GACCACGCCCAGGACGCAGGGGAGACGGCAGCTGGTGGGGGCGAACGGCC 

gAflrRflQgAffaATgTgPGTKgCAgGATGgAGAGGAAGGQTCAGCCCCATG 

GGGGCCXIAGTGATACCCCCAAAACnCAGTCCCAGGTTCTCAGATGCACCT 

TTCTCTOGGAGCATGGNCTTCCTGTGTCCAAACCCCTC^ 

GGGTGAGGGTGGGGCACACTTCGGAGACAAATKAGAAACTCTTAGGCAGG 

GNCCCTGCTAAGGCCCCAGGGAGGCC 
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TTAAACAGTCGACTCTAGACTTAATTAAGGATCCGGCGCGCCCCCGGGTA 

CCGAGCTCAGTGCAGGCCTTGATACACAAGAGACAGTGGTAGGGTGSCTG 

CTAGGTAGTGGGGTAATCTAGGGACTGAGCTGAAACTGGGTGGTGGGGAT 

ATATCCTGAGGATTGTGGCCAGCCCCGGCTCATGTGTGTACCTGAGAGAA 

TATCCTTTTATATCTGGACATGTGTGGGAATATATGTGTGAATGGGAGTC 

TATATGTGTAGATATGGCTAAGAGTGTGTGCATAAGTTTGTGGGGGTACA 

GGTGAGTCAGTGTCTGAACATGAGTATGTGACCATGTGTATTTCAGGGGC 

AGGGTAGACTTCTCCTCATTCATCCCTTCTTCTTCirrcCTT^ 

CATCTCCAGCAGCATGAGCTTTGACGAGGATGAGQAGGATGAGGAGGAGA 

ATAGCTCCAGCTCCTCCCAGCTAAATAGTAACACCCGCCCCAGCTCTGCT 

ACTAQCAGGAAQTCCGTCAGGG TGAGTGAGTGAGTCTGCATCCACAGCAG 

TTTTTGGAGGACTGCTCATCCGTTAGAGGTGGACTGCaiTGTGAAGAGATG 

GACTCGTATGCCTTTAGGAGCTTCTCTGCTGGCCTCTTACGTCCCTCTAC 

CTTGCCTCCTAACCTCTTCAGCTAGGCCAGCAGGGTGATGTATGGGGGGA 

GATGCAGTTGGACAGGATGACCTCTGAGGACCTCCCGTATCTCCCATCTC 

CACCTCTAGGAACTGTTGAGGGCAGGGCTGGGAAGATAGCTTCTGACCCC 

AGGCCCAGGCTGGCCAGGCCCCAATCCCAGGATCCTTCCCTCTCTCCCAC 

CGCCACGTTAGGAGGCAGATTTGGATCCCAGACCACCAATTTGGGCTGCT 

TAGGGTCCTTGGGGCTCAGGCACCTATTCTGCATCCCCATAGSAGGCAOQ 

CTCAGCCCCTAGCCCAACAGCTCCAQAGCAACCAG TnGACGTTGAGGTCC 

AGGATCTTGAGGAGTTTGCACTGAGGCCGCCCCCC CAGGGTATCACCATC 

AAATGCCGCATCACTCQaGACAAQAAAGGGATGG ACCGGGGCATGTACCC 

CACCTACTTTCTGCACCTGGACCGTGAGGATGGGAAGAAG GTAAGGTTGG 

TCTGGGCATGTTATCATCTAGGCTTTACAGCCCTTTGAAATCCTAGGGGC 

TGAAATGTGACTGGAAGTCTCATATCTACCGCTGACCTCTCAGTTCCTCA 

AAGAAACTGCCTTCGTGTCTGGTCTGTGCACATCTTTGTGTTTTCCAGTG 

CATTTGTGTGTGTGCACATATGTGCGTTTGGGAGCTGACGCAACGGAGAG 

AGTCTGTGTGAGTGGCTCTCATGACTGTGTGCAGACCAGAGGCTGAGTCT 

^'JA^Ti^T^^^^T'^'^ tvTv^n aoTT r rra afOTTnTTCrTgCTGGCGGGAAGG A 

AGAGAAAGAAGAGTAAAACTTCCAATTACCTCATCTCTGTGGA CCCAACA 

GACTTGTCTraAaflAC;nfy!ArAGCTATATCGGGA AACTGCGGGTACTAGC 

ATTCCCCCAGGAAGCAGGCGGGAGTGGGAGGGAGGGGCAGGGGCAAGCTG 

TCTGTAGAGGGCCTGAATCTTCCTGAAGGAGATCTAGGCCAGGGATGGAT 

ACTCTCCCAGGATCCTCTCTGATAATCACATCCAACTGGAGGCCTATGTC 

TATGCCAGCCTAGAGCCAGACTTGGAGATGGGACTCACACACCCGACCCC 

AAGCTGTTCCCAGGAGGTGGGTGCAGGCCCACCAAGAGTGATGGATCCAA 

CCCCAGGGTGTCACTGATAACGCAGGCCACCATGGAAGAGTTGCCTTGGC 

TCCATGGTCAATGCCAAGGGACAGGGCTGAGAGTGAGCTCGGTACCCGGG 

GGCGCKCCGGATCCTTAATTAAGTCTAGAGTCGACTGTTTAAG 
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GATTTAGNGGAACACAGCANCTTGNGGGTGGGANGGCAGTGGTGAAG^ 

CyvGGAAGGCTCTGAGCCTAGGCCTCCAGGTGGGGGCAGTGGGGAGGTA 

GTTTGCTGAGGAACTGAGTACCAGATTTGGGGAGCATAAATAAAGA*^ 

AGGTCAGGAGCTAAAGCTGGAGATGGGGCTGGACTGAGACTTAGGCTG^ 

TGCGACAGAGGAGATCTCATCCTCTCTCCACG^ 

CTGTCTTATCAGATGCCATTCTGTTTGCTCACCTC 

CCATGTTCCCCCAGATAAATCIYCTGAAGAATCCTGATTGACCTC 

ATTGCTCTCy^CTGAACTGAAATGCaiCT^^ 

AGGCCTTCTGCCCACGAAGTGTCTTCAAAGATGTGGATTCAGTGAG 

ATGCCTCCCTGGGCCTGCTCCTGTTCCAGCCCAGAATGTTTTGC^ 

CTCATAGGACAGACGATGAGCTGTTCCCTGCTTCTGGGGCAGAGGGT^ 

AAGTTCACTGTTTATGACAATGGAGTCAACCCTCAGAAGGCCTCATC 

CACTTTGGAAAGTGGAACCTTACGTCAGGAGCTGGCAGCTGTGTGCTAC ^ 

TGAGTCCTAGGTTCGGGGGTCTCTGATTTCCAAGGTAGATATGAAATCCA 

GGACTTGATGCCTGATCTAGGGGCTATCCCATCCATCTTAGTGGGTAGAC 

AAGGCTGTGTGGAGAGGGGCTGTCCTCTGTGGAGTGTTCCTGGCCTAGGA 

CAGGGGCTCTGGCTCTCTCCTCCTGACTTCA 
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AGTAGTTTGCCGGAYCGAAGTGGAAGAACARCATaCCCGTCAGCAGAACC 
AAGGATGACGCATAAGAGGAGCTAGTTCTGGCAGGGTAGAGACCCCAGGG 
GCTCAGTTCTGGCCCGTGTTAGGTTTAGAGGGATCTGTGTTAGACTTCGG 
AGTGGAGATGGTGGGAACTAGCTCTTCCTCTTTATTCCCGTCCCCCCCAC 
CTTCTCCAGTAGGTAAATAGACGCCTCAGGTGGCCAGTGTTCCGTTCTCT 
TTCCCAGGAGAGAA&rffT rTTAGGOTTrAAGGGVCCTCGGAAQATOAGPr; 

^GATTGTCCCAGG^^^^^^^^^^^^^^^^C^C 

SGSAASOTGAGTGTCTACCCCTTCCTCCCCTCTTTCCCCATCATCCTAGT 

CTCTGCATGAGCTTCTAAGGGCAGAACTCCAGCTGATGTGTATATGTGGA 

GGGGTACCATGTGAGAAAGCCCTGGAGGTCTAGGGAAATCCAAGGACCCC 

CATTCCCGGGATAGATCCCTTTCTGGGGTGGTCATGGTGCCAAAGGCCTC 

GGCCTGGCTCAGGTGAGGCTGCCCTCCCAG GAGCATGAGAgAgTQTTAnr 

ACGCTGGCAGAATAArtAA CACGGAGTGTATCATCGAGOTGCAAAACAAGA 

CACCTGTCTGGAATGAT GACACACAGTCCTATGTACTCAACTTCCATGflr; 

CGCGTCACAGAGGCCTCgCTGAAGAACTTC CAGATCATCgA-TOGCAATCA 

SCGTGAGTGTTTCTGTCCCTACTCATTATGGTCCGTAGGATACCCAAGGC 

CCTTAGCGTAGGGTTCAGCCCACCTAGCCCTGCCTACACTGGCTAGAGTT 

TAAGAATGTGAGCTATACAGCTAAGGTTAGATGTATGGAACTTTCTAACC 

CTAATGACTGGGAGGTCCTGGAAGAACCTTCTTTGSAGCCCTCGTCCTAG 

ATTCTGTGTATTCAACGGAGTCTCAGGCACGGGAACACCCTTTAAAAGGA 

CTTTTCCTCTTTTCTGTCCCCTGGTGTTCACATGCATCTTACTTTGTCCT 

TTGSCATCTGCCACCTCTTTCCTGCCACTTCTCCCAATTGGCCTTTGTTT 

TACTTCCCTTTGTGATTCCCCTGGCATCTCTGCTTCTCACTTGTTCTTCC 

CTCATGTGGTTTGGGTGTCTGTCTATCCTTCCCTGGCTCTACCATTCCTG 

TCCTGTCCTTTTCTCTGTCTGTGCCTGTGCTTGGCCCCA GCGGAgTArAT 

CGTGATGCAGTTTQGCCGr^, TAGCAGAGGATGTGTTCACCA'rGG^^ 

ACTACCCGCTGTGTGrArTGgAnnrrqv rTGCgA'PrGgCCTGTCgAGCTTC 

GACAGCAAGCTGGgGTPy'GAGTAGACy^ CCTCTTgGTGCCCTTTGGQGTTC; 

GCGAGCCTGGAGgGGAPi rTTGGgTGggTGCCTGTGGAGACAGgGgTGggT 

ATCCTCTGTATATAGGg gTTggGggAGATGAAGgTTTGGCCgTgAGTGr;r; 

CTCCCTGGCCCAGggA GGgAGGAAgTGGgTGgTTTGgGTgTGCTAGTGAG 

TTCTTTCCATGCCACGAGATCC 
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VIKNSNQKGKAKGKGKKKAK 20 

GTS ATA AAG AAC AGC AAT CAA AAG GGC AAA GCC AAA OGA AAA OOC AAA AAG AAA OCG AAG 60 ' 

EERAPSPPVEVDEPREFVLR 40 

GAG GAG AGG GCC CCG TCT COC OCC GTTG GftG GIG GAC GAA CCC OGG GAG TIT GIG CTC OOG 120 

PAPQGRTV RCRLTRDKKGMD 60 

OCT GCC CCC CAG GGC CGC AOG GT6 OGC 1GC OGG CTG ACC OGG GAC AAA AAG GGC ATC GAT IBO 

RGMYPSYFLHLDTEKKVFLL 80 

OGA GGC ATG TAT COC TOC TAC TIC CTG CAC CTG GAC ACG GAG AAG AAG GIG TIC CIC TTG 240 

AGRKRKRSKTANYLISIDPT 100 

OCT OGC AGG AAA CQA AAA COO AGC AAG ACA OCC AAT TAC CTC ATC TCC ATC OAC OCT ACC 300 

NLSRGGENFIGKLRSNLLGN 120 

AAT CTG TCC CGA OGA OGG GAG AAT TIC ATC OGG AAG CTG AGG TCC AAC CIC CTG GOG AAC 360 

RFTVFDNGQNPQRGYSTNVA 140 

CGC TTC ACG GTC TTT GAC AAC GGG CAG AAC CCA CAG OGT OGG TAC AGO ACT AAT GIG GCA 420 

SLRQELAAVIYETNVLGFRG 160 

AGC CTT CGG CAG GAG CTG GCA GOT GTG ATC TAT GAA ACC AAC GTG CTG OGC TTC OGT OGC 4B0 

PRRMTVIIPGMSAENERVPI 180 

CCC CGG OGC ATG ACC CHC ATC ATT COT OGC ATG AGT GOG GAG AAC GAG AOG GTC CCC ATC 540 

RPRNASDGLLVRWQNKTLES 200 

CGG CCC CGA AAT OCT AGT GAC GGC CTG CTG GTG OGC TGG CAG AAC AAG ACG CTG GAG AGC 600 

LIELHKKPPVWNDDSGSYTL 220 

CTC ATA GAA CTG CAC AAC AAG CCA CCT GTC TOG AAC GAT GAC AGT OGC TCC TAC ACC CTC 660 

NFQGRVTQASVKNF OIVHAD 240 

AAC TTC CAA GGC CGG GTC ACC CAG GCC TCA GTC AAG AAC TTC CAG ATT GTC CAC OCT GAT 720 

DPDYIVLQFGRVAEDAFTLD 260 

GAC CCC GAC TAT ATC GTG CTG CAG TIC OGC OGC GTG OCG GAG GAC OOC TTC ACC CTA GAC 780 

YRYPLCALQAFAIALSSFDG 280 

TAC OGG TAC COG CTG TGC OCC CTG CAG GCC TTC GCC ATC GCC CTC TCC AGT TTC GAC OGG 840 

r L A C E • 285 

AAG CTG GCC TGC GAG TGA CCOCAOCAGCCCCTCAGCGCCCCCAGAGCCCGTCAGCGTGGG 900 

GGAAAGGATTCAGTGGAGGCTGGCAOGOTCCCTCC A GCAAAG C ICCCGCGGAAAACTG^ 960 

CCIOTGICGGCX X rX U ACCTCTCACTG C CT C ICGGT ^ ^ 1020 

CACAGGCCGAGGCAOGAOGAGCCaXSACGGCGGGTAGGACGGAGATGAAGAACATCT^ 1080 

GTTGGAGCOGCACAT C TGGTClCGGAG C TC GC CT GO GCC GC TOTg C CCCCCTCCTCC^ 1140 

CGOGACCGOXrrAOGGCTCCGAGGAGCraXXXSSGGCTAGGAGGW 1260 

^ GGGACGAGQGCCAGGCACCCACATCCCCAATAAAGCXA i CU'lC^^^^ 1320 

AAAAAAAAAAAAAAAAAA 1338 
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